40 Commits (4d2d625432e8fdaaaa33042f31ceb6071eef6809)

Author SHA1 Message Date
  nihui 11cffce114
armv8.2 infrastructure (#1856) 5 years ago
  nihui b5e288b521 layer creator function is not necessary for built-in layers 5 years ago
  Tijmen Verhulsdonck 66618340ac
x86 fp16 weight storage optimizations (#1871) 5 years ago
  Tijmen Verhulsdonck 988e8088ea
Fix benchmark (#1864) 6 years ago
  nihui 01b8b79ed2 packing layout option respect support_packing property 6 years ago
  Tijmen Verhulsdonck d1b5711791
X86 Elempack 8 AVX implementations. (#1853) 6 years ago
  nihui 3ef995ed1e
format code style and setup restyled.io (#1840) 6 years ago
  zhiliu6 63d7e2c88d
Add support for darknet EfficientNetB0-Yolov3 conversion. (#1821) 6 years ago
  zhiliu6 3bfabf1d6a
Add fused convolution and mish layer support. (#1761) 6 years ago
  nihui e14716dfef convolution and pooling make padding helper, flatten innerproduct pooling bf16s neon 6 years ago
  xieydd b760e22da2
fix requant relu6 bug (#1590) 6 years ago
  nihui 6f2ef1932d int8 code refactoring wip, add int8 test 6 years ago
  nihuini 336d1c1edd remove the ncnn namespace for in source Option 6 years ago
  nihuini cd4be6d0fa call vulkan create_pipeline on the vkdev condition, drop opt_cpu hacks 6 years ago
  nihuini c0a4ffcf66 convolution pad_value param 6 years ago
  nihuini e4b44d293e more autopad SAME_LOWER 6 years ago
  nihuini 9a6ee37eef asymmetric padding parameter for convolution and deconvolution family 6 years ago
  nihui b4c388a72a Mat misc function accept option parameter, deconvolution pack4 arm neon 6 years ago
  BUG1989 bcfe9f453f initial the ncnn post training quantization tools (#1067) 7 years ago
  nihuini 838c5df839 option api changes 7 years ago
  nihui 3e003ffd98 fuse sigmoid 7 years ago
  nihuini 7a8f68aca6 move vulkan code to subdir, new layer interface create_pipeline and destroy_pipeline for post-loading works 7 years ago
  nihuini 3f85cafc08 fuse relu leakyrelu clip into convolution/deconvolution/innerproduct 7 years ago
  BUG1989 780c7d9a72 merge de/requantize op, optimize some int8 conv layer on arm64-v8a (#867) 7 years ago
  BUG1989 8e337d440e fix the bug with convdw7x7 op working on int8 mode (#818) 7 years ago
  BUG1989 8ff831f7cd fix the segmentation fault when load int8 model (#811) 7 years ago
  BUG1989 df3d224484 new int8 implement,better accuracy (#749) 7 years ago
  nihuini 6f1b0b0a61 quantized padding in convolution, use range sweets 7 years ago
  nihuini 2dbaf6f7b7 store int8 scale in binary 7 years ago
  nihui 2fe7ada4d8 add arm int8 convolution stub, preload group op for x86 7 years ago
  nihui eac7c66a97 fix fp32 group convolution on x86 7 years ago
  nihui 5d04a3a45c layer holds bottom blob scale, depthwise convolution read group scales 7 years ago
  nihuini 6b536701c3 sub-mat shall be allocator-aware 7 years ago
  nihuini 4be27a0a89 int8 inference on x86 7 years ago
  nihui a169cec363 core int8 inference, quantize and dequantize, net using flag, caffe2ncnn reads int8 scale table 7 years ago
  nihui 9706cd1447 implement ncnn blob/workspace allocator, fine-grained per-layer openmp threads control, fix #469 7 years ago
  nihuini 9ac305e160 create 3-dim sub blob for group convolution, fix #315 8 years ago
  nihui 6c4c810fda decouple modelbin of different input types, simplify timestamp function 8 years ago
  nihuini 76a55693a6 decouple convolutiondepthwise and convolution, reduce binary size by 10%, fix #254 8 years ago
  nihuini 03621aa7f9 more x86 stub for convolution and convolutiondepthwise 8 years ago