nihui
9afeb17cd4
ncnn2int8 use fp16 weight type by default ( #5907 )
1 year ago
nihui
66b54cbea2
multiheadattention int8 quantization ( #5733 )
* x86 vulkan fallback
* comment about bf16s
1 year ago
nihui
1c7af00499
gemm int8 quantization ( #5706 )
* quantize gemm
* write gemm quantize scales
* update doc
* less openmp args
* x86 riscv fallback
* skip gemm vulkan int8
* fix noint8 test, fix arm bf16 test
* enable vfpv4 on neon build only
* fix gemm vulkan without C
* fp16 pack8 output
* enable elempack=8 only for asimdhp+
* tiled gemm int8 test
* opt arm64 tiles, fix asimdhp dispatch
1 year ago
nihui
5df5413c81
embed int8 quantization and add embed test ( #5667 )
1 year ago
nihui
08b7d99a75
rnn/lstm/gru dynamic quantization ( #5435 )
2 years ago
teng
e24a2e6c40
[fix] fix typo ( #3377 )
4 years ago
nihuini
1ad37fae1e
generate random weight when null bin passed
5 years ago
nihuini
56d74580a0
fix uninitialized input and weight
5 years ago
nihuini
2b1f8bea25
fix float locale issue in ncnn2mem, fix long line parsing in ncnn2int8
5 years ago
nihui
5fe75f19ef
architecture changes for int8 packing ( #2771 )
* quantize and dequantize tests
* unify activation and usability function
* drop NCNN_REQUANT cmake option, test dequantize requantize pack8, fix webassembly build
* benchmark use requantize int8 model
5 years ago
Guoxia Wang
f3c48cd063
Fix bug of adaptive pool in int8 quantization model ( #2637 )
5 years ago
Zhuo Zhang
30a4482506
fix typo in comment in ncnn2int8.cpp ( #2627 )
5 years ago
nihui
54c0a13b9f
build shared library ( #2525 )
* build shared lib and enable lto
* reserved for layer and option
* allocator pimpl
* datareader pimpl
* paramdict pimpl, disable copy assign for allocator and datareader
* modelbin pimpl
* net extractor pimpl
* gpu pimple
* disable copy assign vulkandevice, code format
* command pimpl, dummy image readonly
* pipeline pipelinecache pimpl, export platform class
* code format, export simple family
* update ci
* disable lto on android armv7, merge webassembly ci
* link libgcc, fix macos dylib version
* pipeline pimpl, gpu info pimpl
* destroy gpu info after vulkan device
* ignore msvc stl class warning
* fix ncnn_paramdict_get_float return type
* fix vktransfer upload fp16 without flatten, add command test
5 years ago
nihui
3ef995ed1e
format code style and setup restyled.io ( #1840 )
6 years ago
JackieWu
ce2251db05
Improve ROIAlign (accelerate ROIAlign, support sampling ratio and aligned ROIAlign) ( #1820 )
* update roialign
* update tool for roialign
* fix ceil for roialign
* fix ios build
* fix
* fix
* make it compatibile with the old version
* fix ios build
* trigger CI
* fix test
* order
* build
* remove code
* merge roialign
* accelerate ROIAlign
* update note
* rename func
* roialign version
* trigger CI
* fix roialign
* use ref for pre-calc in roialign
* retrigger CI
* pre-alloc
6 years ago
nihui
8a84077429
ncnnoptimize lstm
6 years ago
nihui
dba992c15c
ncnnoptimize add pixelshuffle, fix #1608
6 years ago
nihui
05ab6813f5
c++03 compiler happy
6 years ago
kalcohol
6916f45237
fix quantize tools cross platform issues ( #1505 )
* 1. for cross platform, use cv::CommandLineParser instead of getopt.h
2. use cv::glob to search folder instead of dirent.h
3. fix some other warnings of function from cstdlib
4. add some const and some auto
* 1. fix some other warnings of function from cstdlib
2. add some const and some auto
3. i forgot...
* switch quantize tools default on
* check OpenCV first, if not found, disable quantize tools
* add _CRT_SECURE_NO_DEPRECATE for msvc and remove *_s functions
* add _CRT_SECURE_NO_DEPRECATE for msvc and remove *_s functions
* keep the one line style block
* folding bracket
* dynamic_cast -> static_cast
6 years ago
Guoxia Wang
5c08ce7db9
fix Reduction Op attr bug ( #1363 )
* fix onnx2ncnn MemoryData dims problem #1330
* fix ncnn2int8 bug (see #1336 )
* fix bug that MemoryData as multi operations input
* fix ncnn2table find_blob_index_by_name data failed bug, see #1338
* fix ncnn2table find_blob_index_by_name data failed bug, see #1338 , fix change requested
* fix Reduction operation attribution bug
* fix #1363 request changes
6 years ago
Guoxia Wang
31c6cefc44
fix ncnn2int8 bug, see issue #1336 ( #1341 )
* fix onnx2ncnn MemoryData dims problem #1330
* fix ncnn2int8 bug (see #1336 )
6 years ago
daichuanliang
6176ada9f0
update ncnn2int8.cpp ( #1315 )
Fix compile issue with ncnn2int8
6 years ago
MambaWong
f5004f954b
Update ncnn2int8.cpp ( #1200 )
asymmetric padding parameter for convolution and deconvolution family
6 years ago
BUG1989
f79fa87dca
Add the readme file of quantization tools ( #1198 )
6 years ago
BUG1989
bcfe9f453f
initial the ncnn post training quantization tools ( #1067 )
* initial the ncnn post training quantization tools
* clear some comments of tools
* fix the Travis ci compiler error
7 years ago