44 Commits (057b5bb515d551fa64decdb7350422c19feba447)

Author SHA1 Message Date
  nihui 057b5bb515
split tests (#4354) 3 years ago
  nihui 76849cede4
armv8.4 i8mm optimization for convolution gemm int8 (#4034) 3 years ago
  nihui 20a14bf5ae
arm convolution winograd dot function, adjust arm convolution winograd strategy (#3915) 3 years ago
  nihui ca0ba4b25f
fine grained winograd options, adjust x86 convolution winograd strategy (#3908) 3 years ago
  nihui 131f3d1323
x86 avx512 optimization for convolution winograd pack16to1 and deconvolution family, increase simpleomp argv count (#3694) 4 years ago
  nihui dadc640c66
x86 avx512 optimization (#3581) 4 years ago
  nihui 002c07d4ec
mix vulkan winograd f23 and f43 (#3639) 4 years ago
  nihui 6941ec8fc9
arm neon optimization for general packed convolution (#3426) 4 years ago
  nihui 999e640d43
dynamic convolution weight (#3408) 4 years ago
  nihui f10cc6dd93
initial data structure changes for 3dcnn, conv3d, pooling3d (#3378) 4 years ago
  zhiliu6 814f89ef1a
Fuse HardSwish activation into Convolution and InnerProduct (#3233) 4 years ago
  nihuini d6b2ea5aac arm neon optimization for convolution 3x3 on small channels 5 years ago
  nihui 7e1aaa5828
cmake option NCNN_INT8 (#2839) 5 years ago
  nihuini 41a4bea954 unroll size 8 for conv3x3s1 pack8to1 int8 arm64 5 years ago
  nihui e9cc637573
arm neon optimization for int8 packing kernels (#2809) 5 years ago
  nihui 1ea8bfbd2e x86 avx2 conv3x3s1 pack8 direct optimization, fix #2789 5 years ago
  nihui a48bf43ef7 test conv/fc int8 with activation 5 years ago
  nihui 5fe75f19ef
architecture changes for int8 packing (#2771) 5 years ago
  nihui 3c92a1184b
arm neon optimization for general convolution im2col sgemm (#2668) 5 years ago
  nihui ab56083ca5
arm neon optimization for conv3x3s1 winograd42 (#2664) 5 years ago
  nihuini f437bcdd4c enable fp16s and int8s on newer adreno/mali, actually enable int8 tests 5 years ago
  tpoisonooo baf49574c4
innerproduct aarch64 use gemm (#2521) 5 years ago
  nihuini 440db2c8fc conv1x1 pack4 arm fp16sa 5 years ago
  nihuini d17c26e925 conv1x1s1 pack4to8 pack8to4 arm fp16sa 5 years ago
  nihui db5f05c6f0 conv1x1s1 conv3x3s1 winograd pack8to1 arm fp16sa 5 years ago
  nihui 11cffce114
armv8.2 infrastructure (#1856) 5 years ago
  nihui 3ff40b0679
Ci rv32imc (#1940) 5 years ago
  nihuini 0d6cc01d55 innerproduct handle mish activation, fix naive C testing, fix #1930 5 years ago
  Tijmen Verhulsdonck 3325cf94f8
Added AVX swish/lrn/batchnorm (#1897) 5 years ago
  Tijmen Verhulsdonck 26999fab19
Fix AVX wino 3x3 and improve convolution test converage (#1891) 5 years ago
  nihui 3ef995ed1e
format code style and setup restyled.io (#1840) 6 years ago
  nihui 15a4b2c878 test pad same mode 6 years ago
  zhiliu6 3bfabf1d6a
Add fused convolution and mish layer support. (#1761) 6 years ago
  nihuini b2d9325c0d test activation fusion 6 years ago
  nihuini 956ab49d02 fix conv1x1s1 pack4to1 bf16s 6 years ago
  nihuini 36f6942fa0 testing time is too long ... 6 years ago
  nihui 979dd5fd11 test does not need to provide data type options 6 years ago
  nihui 839c4c4e34 smaller test size 6 years ago
  nihui 0f7e7bca02
shader shape specialization constant and basic local group size partition (#1523) 6 years ago
  nihui e050d596b0 fix convolution int8 requant test on x86 6 years ago
  nihuini f813222c1a template candy 6 years ago
  tpoisonooo 7168829f06 Fix int8 requant (#1499) 6 years ago
  nihui 6f2ef1932d int8 code refactoring wip, add int8 test 6 years ago
  nihui 038666e049
the initial auto test (#1464) 6 years ago