You can not select more than 25 topics Topics must start with a chinese character,a letter or number, can include dashes ('-') and can be up to 35 characters long.
 
 
 
 
 
 
nihuini e213605cd4 reduce memory usage of weight packing 7 years ago
..
arm fix compile on ios simulator (#756) 7 years ago
shader fix name conflict on ios 7 years ago
x86 fix the dequantize arm bug (#580) 7 years ago
absval.cpp implement ncnn blob/workspace allocator, fine-grained per-layer openmp threads control, fix #469 8 years ago
absval.h implement ncnn blob/workspace allocator, fine-grained per-layer openmp threads control, fix #469 8 years ago
argmax.cpp implement ncnn blob/workspace allocator, fine-grained per-layer openmp threads control, fix #469 8 years ago
argmax.h implement ncnn blob/workspace allocator, fine-grained per-layer openmp threads control, fix #469 8 years ago
batchnorm.cpp reduce memory usage of weight packing 7 years ago
batchnorm.h reduce memory usage of weight packing 7 years ago
bias.cpp implement ncnn blob/workspace allocator, fine-grained per-layer openmp threads control, fix #469 8 years ago
bias.h implement ncnn blob/workspace allocator, fine-grained per-layer openmp threads control, fix #469 8 years ago
binaryop.cpp massive simd optimize in compute shader (#772) 7 years ago
binaryop.h massive simd optimize in compute shader (#772) 7 years ago
bnll.cpp implement ncnn blob/workspace allocator, fine-grained per-layer openmp threads control, fix #469 8 years ago
bnll.h implement ncnn blob/workspace allocator, fine-grained per-layer openmp threads control, fix #469 8 years ago
clip.cpp implement ncnn blob/workspace allocator, fine-grained per-layer openmp threads control, fix #469 8 years ago
clip.h implement ncnn blob/workspace allocator, fine-grained per-layer openmp threads control, fix #469 8 years ago
concat.cpp add transfer-transfer barrier, concat pack4 7 years ago
concat.h [WIP] vulkan compute (#618) 7 years ago
convolution.cpp reduce memory usage of weight packing 7 years ago
convolution.h reduce memory usage of weight packing 7 years ago
convolutiondepthwise.cpp reduce memory usage of weight packing 7 years ago
convolutiondepthwise.h reduce memory usage of weight packing 7 years ago
crop.cpp implement ncnn blob/workspace allocator, fine-grained per-layer openmp threads control, fix #469 8 years ago
crop.h implement ncnn blob/workspace allocator, fine-grained per-layer openmp threads control, fix #469 8 years ago
deconvolution.cpp implement ncnn blob/workspace allocator, fine-grained per-layer openmp threads control, fix #469 8 years ago
deconvolution.h implement ncnn blob/workspace allocator, fine-grained per-layer openmp threads control, fix #469 8 years ago
deconvolutiondepthwise.cpp improve group deconvolution openmp scheduler 7 years ago
deconvolutiondepthwise.h implement ncnn blob/workspace allocator, fine-grained per-layer openmp threads control, fix #469 8 years ago
dequantize.cpp layer holds bottom blob scale, depthwise convolution read group scales 7 years ago
dequantize.h core int8 inference, quantize and dequantize, net using flag, caffe2ncnn reads int8 scale table 7 years ago
detectionoutput.cpp fix caffe ssd, somewhat ugly though ... 7 years ago
detectionoutput.h mxnet-ssd wip ... 7 years ago
dropout.cpp massive simd optimize in compute shader (#772) 7 years ago
dropout.h massive simd optimize in compute shader (#772) 7 years ago
eltwise.cpp massive simd optimize in compute shader (#772) 7 years ago
eltwise.h massive simd optimize in compute shader (#772) 7 years ago
elu.cpp implement ncnn blob/workspace allocator, fine-grained per-layer openmp threads control, fix #469 8 years ago
elu.h implement ncnn blob/workspace allocator, fine-grained per-layer openmp threads control, fix #469 8 years ago
embed.cpp implement ncnn blob/workspace allocator, fine-grained per-layer openmp threads control, fix #469 8 years ago
embed.h implement ncnn blob/workspace allocator, fine-grained per-layer openmp threads control, fix #469 8 years ago
exp.cpp implement ncnn blob/workspace allocator, fine-grained per-layer openmp threads control, fix #469 8 years ago
exp.h implement ncnn blob/workspace allocator, fine-grained per-layer openmp threads control, fix #469 8 years ago
expanddims.cpp implement ncnn blob/workspace allocator, fine-grained per-layer openmp threads control, fix #469 8 years ago
expanddims.h implement ncnn blob/workspace allocator, fine-grained per-layer openmp threads control, fix #469 8 years ago
flatten.cpp flatten pack1to4 7 years ago
flatten.h add flatten pack4 7 years ago
innerproduct.cpp reduce memory usage of weight packing 7 years ago
innerproduct.h reduce memory usage of weight packing 7 years ago
input.cpp [WIP] vulkan compute (#618) 7 years ago
input.h implement ncnn blob/workspace allocator, fine-grained per-layer openmp threads control, fix #469 8 years ago
instancenorm.cpp implement ncnn blob/workspace allocator, fine-grained per-layer openmp threads control, fix #469 8 years ago
instancenorm.h implement ncnn blob/workspace allocator, fine-grained per-layer openmp threads control, fix #469 8 years ago
interp.cpp implement ncnn blob/workspace allocator, fine-grained per-layer openmp threads control, fix #469 8 years ago
interp.h implement ncnn blob/workspace allocator, fine-grained per-layer openmp threads control, fix #469 8 years ago
log.cpp implement ncnn blob/workspace allocator, fine-grained per-layer openmp threads control, fix #469 8 years ago
log.h implement ncnn blob/workspace allocator, fine-grained per-layer openmp threads control, fix #469 8 years ago
lrn.cpp lrn norm across channel pack4, rename member name with pipeline prefix 7 years ago
lrn.h lrn norm across channel pack4, rename member name with pipeline prefix 7 years ago
lstm.cpp fixed cell initialization in LSTM layer 7 years ago
lstm.h implement ncnn blob/workspace allocator, fine-grained per-layer openmp threads control, fix #469 8 years ago
memorydata.cpp implement ncnn blob/workspace allocator, fine-grained per-layer openmp threads control, fix #469 8 years ago
memorydata.h implement ncnn blob/workspace allocator, fine-grained per-layer openmp threads control, fix #469 8 years ago
mvn.cpp implement ncnn blob/workspace allocator, fine-grained per-layer openmp threads control, fix #469 8 years ago
mvn.h implement ncnn blob/workspace allocator, fine-grained per-layer openmp threads control, fix #469 8 years ago
normalize.cpp implement ncnn blob/workspace allocator, fine-grained per-layer openmp threads control, fix #469 8 years ago
normalize.h implement ncnn blob/workspace allocator, fine-grained per-layer openmp threads control, fix #469 8 years ago
packing.cpp packing shader for 3-dim blob 7 years ago
packing.h packing shader for 3-dim blob 7 years ago
padding.cpp massive simd optimize in compute shader (#772) 7 years ago
padding.h massive simd optimize in compute shader (#772) 7 years ago
permute.cpp mxnet-ssd wip ... 7 years ago
permute.h implement ncnn blob/workspace allocator, fine-grained per-layer openmp threads control, fix #469 8 years ago
pooling.cpp pooling full padding 7 years ago
pooling.h massive simd optimize in compute shader (#772) 7 years ago
power.cpp implement ncnn blob/workspace allocator, fine-grained per-layer openmp threads control, fix #469 8 years ago
power.h implement ncnn blob/workspace allocator, fine-grained per-layer openmp threads control, fix #469 8 years ago
prelu.cpp implement ncnn blob/workspace allocator, fine-grained per-layer openmp threads control, fix #469 8 years ago
prelu.h implement ncnn blob/workspace allocator, fine-grained per-layer openmp threads control, fix #469 8 years ago
priorbox.cpp mxnet-ssd done 7 years ago
priorbox.h implement ncnn blob/workspace allocator, fine-grained per-layer openmp threads control, fix #469 8 years ago
proposal.cpp implement ncnn blob/workspace allocator, fine-grained per-layer openmp threads control, fix #469 8 years ago
proposal.h implement ncnn blob/workspace allocator, fine-grained per-layer openmp threads control, fix #469 8 years ago
psroipooling.cpp implement psroipooling layer 7 years ago
psroipooling.h implement psroipooling layer 7 years ago
quantize.cpp core int8 inference, quantize and dequantize, net using flag, caffe2ncnn reads int8 scale table 7 years ago
quantize.h core int8 inference, quantize and dequantize, net using flag, caffe2ncnn reads int8 scale table 7 years ago
reduction.cpp fix reduction dim 2, fix #593 7 years ago
reduction.h implement ncnn blob/workspace allocator, fine-grained per-layer openmp threads control, fix #469 8 years ago
relu.cpp massive simd optimize in compute shader (#772) 7 years ago
relu.h massive simd optimize in compute shader (#772) 7 years ago
reorg.cpp implement ncnn blob/workspace allocator, fine-grained per-layer openmp threads control, fix #469 8 years ago
reorg.h implement ncnn blob/workspace allocator, fine-grained per-layer openmp threads control, fix #469 8 years ago
reshape.cpp implement ncnn blob/workspace allocator, fine-grained per-layer openmp threads control, fix #469 8 years ago
reshape.h implement ncnn blob/workspace allocator, fine-grained per-layer openmp threads control, fix #469 8 years ago
rnn.cpp implement ncnn blob/workspace allocator, fine-grained per-layer openmp threads control, fix #469 8 years ago
rnn.h implement ncnn blob/workspace allocator, fine-grained per-layer openmp threads control, fix #469 8 years ago
roialign.cpp trival fix for the last element 7 years ago
roialign.h implement roialign layer 7 years ago
roipooling.cpp implement ncnn blob/workspace allocator, fine-grained per-layer openmp threads control, fix #469 8 years ago
roipooling.h implement ncnn blob/workspace allocator, fine-grained per-layer openmp threads control, fix #469 8 years ago
scale.cpp reduce memory usage of weight packing 7 years ago
scale.h reduce memory usage of weight packing 7 years ago
shufflechannel.cpp implement ncnn blob/workspace allocator, fine-grained per-layer openmp threads control, fix #469 8 years ago
shufflechannel.h implement ncnn blob/workspace allocator, fine-grained per-layer openmp threads control, fix #469 8 years ago
sigmoid.cpp implement ncnn blob/workspace allocator, fine-grained per-layer openmp threads control, fix #469 8 years ago
sigmoid.h implement ncnn blob/workspace allocator, fine-grained per-layer openmp threads control, fix #469 8 years ago
slice.cpp implement ncnn blob/workspace allocator, fine-grained per-layer openmp threads control, fix #469 8 years ago
slice.h implement ncnn blob/workspace allocator, fine-grained per-layer openmp threads control, fix #469 8 years ago
softmax.cpp lrn norm across channel pack4, rename member name with pipeline prefix 7 years ago
softmax.h lrn norm across channel pack4, rename member name with pipeline prefix 7 years ago
split.cpp [WIP] vulkan compute (#618) 7 years ago
split.h [WIP] vulkan compute (#618) 7 years ago
spp.cpp implement ncnn blob/workspace allocator, fine-grained per-layer openmp threads control, fix #469 8 years ago
spp.h implement ncnn blob/workspace allocator, fine-grained per-layer openmp threads control, fix #469 8 years ago
squeeze.cpp implement ncnn blob/workspace allocator, fine-grained per-layer openmp threads control, fix #469 8 years ago
squeeze.h implement ncnn blob/workspace allocator, fine-grained per-layer openmp threads control, fix #469 8 years ago
tanh.cpp implement ncnn blob/workspace allocator, fine-grained per-layer openmp threads control, fix #469 8 years ago
tanh.h implement ncnn blob/workspace allocator, fine-grained per-layer openmp threads control, fix #469 8 years ago
threshold.cpp implement ncnn blob/workspace allocator, fine-grained per-layer openmp threads control, fix #469 8 years ago
threshold.h implement ncnn blob/workspace allocator, fine-grained per-layer openmp threads control, fix #469 8 years ago
tile.cpp implement ncnn blob/workspace allocator, fine-grained per-layer openmp threads control, fix #469 8 years ago
tile.h implement ncnn blob/workspace allocator, fine-grained per-layer openmp threads control, fix #469 8 years ago
unaryop.cpp massive simd optimize in compute shader (#772) 7 years ago
unaryop.h massive simd optimize in compute shader (#772) 7 years ago
yolodetectionoutput.cpp implement multiscale yolov2, update example model comment 7 years ago
yolodetectionoutput.h implement multiscale yolov2, update example model comment 7 years ago
yolov3detectionoutput.cpp Increase a few performance of yolov3 and change tab to space (#767) 7 years ago
yolov3detectionoutput.h Support yolov3 and mobilenet yolov3 7 years ago