nihuini
26dc9820e4
custom mlir ncnn optimize pass, add efficientnetv2_b0 benchmark
5 years ago
nihuini
b7959a4fc7
fuse mlir swish and conv-addv2
5 years ago
nihuini
e34edfb672
fix caffe2ncnn input shape for 1-dim and 2-dim blob
5 years ago
nihuini
34bd5ef161
update eq quant info
5 years ago
nihuini
72ef77a469
fix build with NCNN_STRING off and NCNN_VULKAN on
5 years ago
nihuini
d021079cd3
implement ncnn2table eq method
5 years ago
nihui
7d10e5593b
fix build
5 years ago
nihuini
45e548e342
rewrite ncnn2table tool, with better multithreading processing
5 years ago
nihuini
f9e42d4ee4
drop fp16 and int8 mode from caffe2ncnn, ncnnoptimize and ncnn2int8 should be used instead
5 years ago
nihuini
d095149f59
workaround mxnet op non-unique name issue, fix #2855
5 years ago
nihuini
79927a9eeb
update quantized int8 inference doc
5 years ago
nihuini
31d436c627
more verbose load failure, ncnn2int8 write int8 data properly
5 years ago
nihuini
56d74580a0
fix uninitialized input and weight
5 years ago
nihuini
97903bbe85
fix alerts
5 years ago
nihuini
2b1f8bea25
fix float locale issue in ncnn2mem, fix long line parsing in ncnn2int8
5 years ago
nihui
7e1aaa5828
cmake option NCNN_INT8 ( #2839 )
5 years ago
nihuini
6c41822d35
do not remove optional hidden output for lstm/gru/rnn
5 years ago
nihuini
5e85f447e6
convert mxnet channel padding, fix some lgtm warnings
5 years ago
nihui
5fe75f19ef
architecture changes for int8 packing ( #2771 )
* quantize and dequantize tests
* unify activation and usability function
* drop NCNN_REQUANT cmake option, test dequantize requantize pack8, fix webassembly build
* benchmark use requantize int8 model
5 years ago
nihui
d4a7abc218
fix onnx2ncnn clip without max blob, fix #2788
5 years ago
restyled-io[bot]
5f00ba89d2
feat(ncnnoptimize): replace denormals to zero on layers with weights ( #2690 )
* feat(ncnnoptimize): replace denormals to zero on layers with weights
Co-authored-by: youngsoo.lee <youngsoo15.lee@gmail.com>
Co-authored-by: Restyled.io <commits@restyled.io>
5 years ago
nihui
19c183cacc
fix ncnnoptimize default cut range
5 years ago
restyled-io[bot]
5e565cfa8a
Restyle add net Cut Function ( #2763 )
* add ncnnoptimise cut net function
* add ncnnoptimise cut net function
* Restyled by clang-format
* Restyled by astyle
* Restyled by clang-format
* Restyled by astyle
Co-authored-by: chenty <admin@chenty.com>
Co-authored-by: Restyled.io <commits@restyled.io>
5 years ago
nihuini
15d63ec0f5
fuse onnx multiheadattention with same qkv blob
5 years ago
nihuini
e449435dbe
fix mlir2ncnn warning, prettier alignment :)
5 years ago
nihuini
c8ccccf045
adapt mlir changes
5 years ago
nihuini
c17eb4e208
multiheadattention layer
5 years ago
nihuini
b0d16325b1
fuse onnx binaryop with scalar
5 years ago
nihuini
f7cbcaa72b
fix onnx normalize expand ghost shape
5 years ago
nihuini
c910574b5b
fuse onnx multiheadattention
5 years ago
nihuini
f2a5ea7678
fix layernorm ghost input without affine
5 years ago
nihuini
7ac23ab34d
fuse onnx layernorm, fix 2-dim layernorm implementation, add test
5 years ago
nihui
d3d16d2413
fix ncnnoptimize crash on models with multiple custom layers
5 years ago
nihuini
a1839f6bce
fix ncnnoptimize shufflechannel reverse mode
5 years ago
nihuini
5b72a37544
fix megvii style shufflechannel blob count mismatch
5 years ago
nihuini
f47fbcbf83
preserve the very first onnx constant dimension size when we guess it is not the batchsize one, fix #2487
5 years ago
nihuini
0e79532975
topological sort onnx nodes, convert onnxruntime bert ops
5 years ago
Guoxia Wang
f3c48cd063
Fix bug of adaptive pool in int8 quantization model ( #2637 )
5 years ago
daquexian
f684dc236b
allow set llvm/mlir dirs from cli ( #2624 )
5 years ago
Zhuo Zhang
30a4482506
fix typo in comment in ncnn2int8.cpp ( #2627 )
5 years ago
nihuini
6550a517ef
adapt mlir api changes, update tf dialect
5 years ago
nihui
b01728a3be
fix crash when eliminating memory footprint on undefined blobs, fix #2609
5 years ago
nihuini
11c569fc52
fix in-between empty input node
5 years ago
nihuini
5bd07030ab
remove onnx expand shape node
5 years ago
nihui
4f2907d129
arm neon optimization for rnn ( #2601 )
5 years ago
nihuini
6572e98f66
handle onnx transpose 102 201 after lstm
5 years ago
nihui
ef3550b52f
gru and rnn layer ( #2572 )
5 years ago
nihuini
cf853bd3ce
convert onnx gru rnn graph
5 years ago
Guoxia Wang
13d0bfcfd0
fix onnx2ncnn adaptive pool conversion bug, remove no used weights node ( #2570 )
5 years ago
nihuini
8790de3507
better onnx lstm fusion
5 years ago