69 Commits (1e75a2df21b8fc6c19444a6660fcd67c793f45a2)

Author SHA1 Message Date
  nihui 556b79ce4d
create layer decoupled (#5258) 2 years ago
  nihui f1ea792b26
fix too many microtask error in old libomp runtime (#4002) 3 years ago
  Evgeny Proydakov 86a785c4aa
Fixed linux-gcc noint8t build: (#3888) 4 years ago
  nihui 131f3d1323
x86 avx512 optimization for convolution winograd pack16to1 and deconvolution family, increase simpleomp argv count (#3694) 4 years ago
  nihui c0a94cd9ca
fix armv7 without neon (#3514) 4 years ago
  nihui 999e640d43
dynamic convolution weight (#3408) 4 years ago
  zhiliu6 814f89ef1a
Fuse HardSwish activation into Convolution and InnerProduct (#3233) 4 years ago
  nihui 7e1aaa5828
cmake option NCNN_INT8 (#2839) 5 years ago
  nihui 5fe75f19ef
architecture changes for int8 packing (#2771) 5 years ago
  Leo 5afd318b86
Support remove libstdc++ denpendency (#2030) 5 years ago
  nihui 11cffce114
armv8.2 infrastructure (#1856) 6 years ago
  nihui b5e288b521 layer creator function is not necessary for built-in layers 6 years ago
  nihui 3ef995ed1e
format code style and setup restyled.io (#1840) 6 years ago
  zhiliu6 3bfabf1d6a
Add fused convolution and mish layer support. (#1761) 6 years ago
  Naiyang Lin ceef2470a5
Add logger.h (#1753) 6 years ago
  nihui e14716dfef convolution and pooling make padding helper, flatten innerproduct pooling bf16s neon 6 years ago
  nihui 6f2ef1932d int8 code refactoring wip, add int8 test 6 years ago
  Sungmann Cho 9bfc554bc9 Fix warnings on Visual Studio (#1431) 6 years ago
  nihuini 336d1c1edd remove the ncnn namespace for in source Option 6 years ago
  nihuini 567e2bd501 a dirty hack for resolving int8 pack4 crash 6 years ago
  nihuini cd4be6d0fa call vulkan create_pipeline on the vkdev condition, drop opt_cpu hacks 6 years ago
  nihuini c0a4ffcf66 convolution pad_value param 6 years ago
  Xu Yang 31cf7f3c5b fix ConvolutionDepthWise int8_requantize (#1233) 6 years ago
  nihuini 296e0022df deconvolution output adj and output shape 6 years ago
  nihuini 0e26e3094e autopad SAME_LOWER 6 years ago
  nihuini 9a6ee37eef asymmetric padding parameter for convolution and deconvolution family 6 years ago
  nihui b4c388a72a Mat misc function accept option parameter, deconvolution pack4 arm neon 6 years ago
  BUG1989 bcfe9f453f initial the ncnn post training quantization tools (#1067) 7 years ago
  nihuini 838c5df839 option api changes 7 years ago
  nihui 3e003ffd98 fuse sigmoid 7 years ago
  nihuini 7a8f68aca6 move vulkan code to subdir, new layer interface create_pipeline and destroy_pipeline for post-loading works 7 years ago
  nihuini 528fe8e9e3 gpu convolution/deconvolution/innerproduct fuse activation 7 years ago
  nihuini 3f85cafc08 fuse relu leakyrelu clip into convolution/deconvolution/innerproduct 7 years ago
  nihui 274392eb80 convolution padding same on gpu 7 years ago
  BUG1989 780c7d9a72 merge de/requantize op, optimize some int8 conv layer on arm64-v8a (#867) 7 years ago
  nihuini b2e41bf83d fallback convolution to cpu path for pad -233 7 years ago
  nihuini 433a92401a auto barrier in pipeline and copy command 7 years ago
  BUG1989 8e337d440e fix the bug with convdw7x7 op working on int8 mode (#818) 7 years ago
  BUG1989 8ff831f7cd fix the segmentation fault when load int8 model (#811) 7 years ago
  BUG1989 df3d224484 new int8 implement,better accuracy (#749) 7 years ago
  nihui cc4376d8e6 do not upload unnecessary pack1 weight, reduce gpu memory usage 7 years ago
  nihuini 43c4b57201 group deconvolution packing family 7 years ago
  nihuini 8547864b6f group convolution packing family 7 years ago
  nihuini 37413ea95c implement depthwise deconvolution vulkan, fix top blob state 7 years ago
  nihuini e213605cd4 reduce memory usage of weight packing 7 years ago
  nihui 960ffa1a50 optimize workgroup size for convolution depthwise and innerproduct pack4 7 years ago
  nihui ad68e1e0e6 enable googlenet alexnet vulkan benchmark, fix build on msvc 7 years ago
  nihui f0b4933eac
massive simd optimize in compute shader (#772) 7 years ago
  nihui 10b8ac68cc
[WIP] vulkan compute (#618) 7 years ago
  nihuini ef36d79b7e implement the missing dequantize image on armv7, prefer neon-optimized 3-dim dequantize, fix #547 7 years ago