63 Commits (40a69a2dd3aad1ce6c776a1d178cb2c6d2fffe85)

Author SHA1 Message Date
  nihui 40a69a2dd3
discard riscv weight memory (#3874) 4 years ago
  nihui 241524ffce
discard weight memory for x86 arm vulkan (#3865) 4 years ago
  Yoh bf1109c10b
fix elempack16 bug (#3728) 4 years ago
  nihui dadc640c66
x86 avx512 optimization (#3581) 4 years ago
  nihui 920aa79f04
drop x86 avx2 fp16 (#3568) 4 years ago
  nihui de77b669c4
x86 sse2 optimization for conv1x1/3x3 pack4 and general sgemm pack4/pack4to1 (#3538) 4 years ago
  nihui 6941ec8fc9
arm neon optimization for general packed convolution (#3426) 4 years ago
  nihui 999e640d43
dynamic convolution weight (#3408) 4 years ago
  nihui 24fbb6e8cb
honor thread setting on load and vulkan command, ci avx512 t4 (#3391) 4 years ago
  Tijmen Verhulsdonck ac5dc23ccc
added a number of optimized sse layers (#3302) 4 years ago
  zhiliu6 a08f700775
Optimize avx convolution activation (#3299) 4 years ago
  zhiliu6 814f89ef1a
Fuse HardSwish activation into Convolution and InnerProduct (#3233) 4 years ago
  Tijmen Verhulsdonck 4270b5c502
Fix broken codepaths with AVX only (#3254) 4 years ago
  Tijmen Verhulsdonck eaa7e24db6
Added ability to switch AVX/AVX2 during runtime (#3076) 5 years ago
  Evgeny Proydakov 9245cdca42
Fixed compile warnings for clang compiler on MacOS. [-Wunused-parameter] (#2998) 5 years ago
  nihuini 687cc857b1 some x86 sse2 optimization for convolution int8 5 years ago
  nihui 7e1aaa5828
cmake option NCNN_INT8 (#2839) 5 years ago
  nihuini 256754bff9 fix build with old gcc, fix #2805 5 years ago
  nihui 5fe75f19ef
architecture changes for int8 packing (#2771) 5 years ago
  nihuini 1a3191e245 fix libncnn build with gcc-4.8 and gcc-4.4, fix #2388 5 years ago
  nihui a071637064
optional sse2 (#2373) 5 years ago
  PENGUINLIONG 8f8f2de4d0
SSE2 optimization pack (#2123) 5 years ago
  Evgeny Proydakov 7dfb1cbafc
Fixed compile warning for x86 layer with reason: [-Wunused-variable] (#2186) 5 years ago
  nihui 11cffce114
armv8.2 infrastructure (#1856) 6 years ago
  nihui b5e288b521 layer creator function is not necessary for built-in layers 6 years ago
  Tijmen Verhulsdonck 66618340ac
x86 fp16 weight storage optimizations (#1871) 6 years ago
  Tijmen Verhulsdonck 988e8088ea
Fix benchmark (#1864) 6 years ago
  nihui 01b8b79ed2 packing layout option respect support_packing property 6 years ago
  Tijmen Verhulsdonck d1b5711791
X86 Elempack 8 AVX implementations. (#1853) 6 years ago
  nihui 3ef995ed1e
format code style and setup restyled.io (#1840) 6 years ago
  zhiliu6 63d7e2c88d
Add support for darknet EfficientNetB0-Yolov3 conversion. (#1821) 6 years ago
  zhiliu6 3bfabf1d6a
Add fused convolution and mish layer support. (#1761) 6 years ago
  nihui e14716dfef convolution and pooling make padding helper, flatten innerproduct pooling bf16s neon 6 years ago
  xieydd b760e22da2
fix requant relu6 bug (#1590) 6 years ago
  nihui 6f2ef1932d int8 code refactoring wip, add int8 test 6 years ago
  nihuini 336d1c1edd remove the ncnn namespace for in source Option 6 years ago
  nihuini cd4be6d0fa call vulkan create_pipeline on the vkdev condition, drop opt_cpu hacks 6 years ago
  nihuini c0a4ffcf66 convolution pad_value param 6 years ago
  nihuini e4b44d293e more autopad SAME_LOWER 6 years ago
  nihuini 9a6ee37eef asymmetric padding parameter for convolution and deconvolution family 6 years ago
  nihui b4c388a72a Mat misc function accept option parameter, deconvolution pack4 arm neon 7 years ago
  BUG1989 bcfe9f453f initial the ncnn post training quantization tools (#1067) 7 years ago
  nihuini 838c5df839 option api changes 7 years ago
  nihui 3e003ffd98 fuse sigmoid 7 years ago
  nihuini 7a8f68aca6 move vulkan code to subdir, new layer interface create_pipeline and destroy_pipeline for post-loading works 7 years ago
  nihuini 3f85cafc08 fuse relu leakyrelu clip into convolution/deconvolution/innerproduct 7 years ago
  BUG1989 780c7d9a72 merge de/requantize op, optimize some int8 conv layer on arm64-v8a (#867) 7 years ago
  BUG1989 8e337d440e fix the bug with convdw7x7 op working on int8 mode (#818) 7 years ago
  BUG1989 8ff831f7cd fix the segmentation fault when load int8 model (#811) 7 years ago
  BUG1989 df3d224484 new int8 implement,better accuracy (#749) 7 years ago