93 Commits (90e6be457b8094fcb63d219691cdf0c41fe01fc0)

Author SHA1 Message Date
  nihui 90e6be457b conv1x1s1 bf16s neon kernel 6 years ago
  nihuini 1984cad0e1 conv5x5s2 bf16s neon kernel 6 years ago
  nihuini 17577775ae conv5x5s1 bf16s neon kernel 6 years ago
  nihui 6561334c5f conv7x7s2 pack1to4 bf16s neon kernel 6 years ago
  nihui b7c82fcc45 code clean, concat bf16s 6 years ago
  nihui c3f966f7e7 conv3x3s1 pack4to1 bf16s neon kernel 6 years ago
  nihuini c6ebd13afb conv1x1 pack4to1 bf16s neon kernel 6 years ago
  nihui 7d1eec3d5d the use_bf16_storage option 6 years ago
  nihui c819b4d839 fix build without openmp 6 years ago
  nihui e14716dfef convolution and pooling make padding helper, flatten innerproduct pooling bf16s neon 6 years ago
  nihui 57bedd59fa fix build without neon 6 years ago
  nihui 719d9f48ae im2col wrt sgemm convolution option 6 years ago
  nihui 25eb060b7c fix potential crash on int8 convolution with no bias 6 years ago
  tpoisonooo 2e51b026ce fix some boring compile warnings (#1510) 6 years ago
  tpoisonooo 7168829f06 Fix int8 requant (#1499) 6 years ago
  nihui 6f2ef1932d int8 code refactoring wip, add int8 test 6 years ago
  nihui 038666e049
the initial auto test (#1464) 6 years ago
  tpoisonooo d702052449 Add assembly int8 gemm (#1307) 6 years ago
  nihuini 336d1c1edd remove the ncnn namespace for in source Option 6 years ago
  nihuini 567e2bd501 a dirty hack for resolving int8 pack4 crash 6 years ago
  nihuini 65ce6bccfd faster weight transform for optimized kernel 6 years ago
  nihuini cd4be6d0fa call vulkan create_pipeline on the vkdev condition, drop opt_cpu hacks 6 years ago
  nihuini 19d75955d6 arm neon assembly optimization for conv3x3s1 winograd pack4to1 6 years ago
  nihuini e63e2449fd arm neon assembly optimization for conv7x7s2 pack1to4 6 years ago
  nihui 56fd26a2da arm neon assembly optimization for conv1x1s1 pack4to1 6 years ago
  nihuini 15e86dc8e9 reduce pack4 weight memory usage for specialized kernel, reduce runtime memory usage in conv3x3s1 winograd 6 years ago
  nihuini c5f1dc3fe4 arm neon assembly optimization for conv3x3s1 pack4to1 6 years ago
  nihui e0f6e3f669 pre-interleave 8-channel weight data on aarch64, conv1x1s1 version 6 years ago
  nihui 7173b6e38e arm neon assembly optimization for conv3x3s2 pack4 6 years ago
  nihuini cf0c49dd71 arm neon assembly optimization for conv5x5s1 pack4 and conv5x5s2 pack4 6 years ago
  nihui 9e529354fb arm neon optimization for conv1x1s2 pack4 6 years ago
  nihui 48e3e7d49c move neon activation into a wrapper function 6 years ago
  nihui e19b7097df arm neon assembly optimization for conv3x3s1 pack1to4 6 years ago
  nihui 3a452f734a arm neon assembly optimization for conv3x3s2 pack1to4 6 years ago
  nihui 6edd42f566 arm neon assembly for conv1x1s1 and conv3x3s1 winograd pack4 6 years ago
  nihuini c0a4ffcf66 convolution pad_value param 6 years ago
  nihui 4dc98ffaab conv1x1s1 and conv3x3s1 winograd pack4 neon optimization, first try 6 years ago
  nihuini 296e0022df deconvolution output adj and output shape 6 years ago
  nihuini e4b44d293e more autopad SAME_LOWER 6 years ago
  nihuini 9a6ee37eef asymmetric padding parameter for convolution and deconvolution family 6 years ago
  nihui 394f6786b9 neon enable support_packing 6 years ago
  nihui b4c388a72a Mat misc function accept option parameter, deconvolution pack4 arm neon 6 years ago
  tpoisonooo 1a0459cffe Update convolution_arm.cpp 6 years ago
  nihuini c4f23ae8ad rename Mat packing to elempack 6 years ago
  nihui 7655b9e4e9 fix build on armv7 again ... 6 years ago
  nihui a97439988f fix build on armv7 6 years ago
  nihuini 81a5dfe76b general convolution and convolutiondepthwise arm neon pack4, wip 6 years ago
  tpoisonooo 1ca4387c9c Auto choose conv implementation (#1085) 6 years ago
  BUG1989 bcfe9f453f initial the ncnn post training quantization tools (#1067) 7 years ago
  BUG1989 d9f269fa3d use sgemm fp32 on arm platform,optimize conv1x1s2 (#1031) 7 years ago