nihuini
|
ef36d79b7e
|
implement the missing dequantize image on armv7, prefer neon-optimized 3-dim dequantize, fix #547
|
7 years ago |
nihuini
|
23de61fd07
|
as we already have the int8_scale_term switch, do not have to rely on the actual scale value
|
7 years ago |
nihuini
|
6f1b0b0a61
|
quantized padding in convolution, use range sweets
|
7 years ago |
nihuini
|
2dbaf6f7b7
|
store int8 scale in binary
|
7 years ago |
nihui
|
fe14037777
|
more sub op preload
|
7 years ago |
nihui
|
2fe7ada4d8
|
add arm int8 convolution stub, preload group op for x86
|
7 years ago |
nihui
|
5d04a3a45c
|
layer holds bottom blob scale, depthwise convolution read group scales
|
7 years ago |
nihui
|
a169cec363
|
core int8 inference, quantize and dequantize, net using flag, caffe2ncnn reads int8 scale table
|
7 years ago |
nihui
|
9706cd1447
|
implement ncnn blob/workspace allocator, fine-grained per-layer openmp threads control, fix #469
|
8 years ago |
nihuini
|
0ce0c11851
|
load sub-op in advance for group convolution
|
8 years ago |
Tiancai Ye
|
3977d32eb9
|
Fix windows build fails (#321)
* fix windows build error
* remove wrong commit
|
8 years ago |
nihuini
|
76a55693a6
|
decouple convolutiondepthwise and convolution, reduce binary size by 10%, fix #254
|
8 years ago |
nihuini
|
a84ba8fc0f
|
element type storage support in Mat, move data member the first so that a pointer to Mat is a pointer to data, convenient index access for float vector
|
8 years ago |
nihui
|
a181d25098
|
new model load api, fix #215
|
8 years ago |
nihui
|
bdb70a2010
|
padding w h in convolution and deconvolution
|
8 years ago |
nihui
|
44b4519307
|
non-square convolution and deconvolution kernel stride dilation
|
8 years ago |
nihui
|
1e2265dd99
|
new param load api
|
8 years ago |
nihuini
|
47218db6e5
|
fix minus padding SAME, fix #116
|
8 years ago |
nihuini
|
b4e3615ee4
|
depth-wise optimize
|
8 years ago |
nihuini
|
9f51c21322
|
implement depth-wise convolution
|
8 years ago |