42 Commits (db035d602de6ec0cd3bdd191cb21f4b73e7599be)

Author SHA1 Message Date
  nihui db035d602d
update ncnnoptimize layers, lightmode=false keeps original weight (#5414) 2 years ago
  nihui 556b79ce4d
create layer decoupled (#5258) 2 years ago
  nihui 4494aadd74
deconvolution dynamic weight (#5119) 2 years ago
  zhiliu6 125b9f2baf
reduce double usage (#4671) 3 years ago
  nihui a961ab992e
arm deconv matmul use gemm (#4594) 3 years ago
  nihui c471826da1
fix arm bfloat2float float2bfloat oops (#4439) 3 years ago
  nihui dd86cebab8
armv8.6 ci and coverage (#4025) 3 years ago
  nihui 7886e90c65
split arm82 source for smaller binary and memory footprint (#3877) 4 years ago
  nihui 241524ffce
discard weight memory for x86 arm vulkan (#3865) 4 years ago
  nihui 3f2799d706
always build tightly packed weight, fix #3545 (#3547) 4 years ago
  nihui c0a94cd9ca
fix armv7 without neon (#3514) 4 years ago
  nihui adfc8b25bc
fix deconv output pad (#3337) 4 years ago
  nihui cdf45a6512
cmake option NCNN_BF16 (#3068) 4 years ago
  nihui 5fe75f19ef
architecture changes for int8 packing (#2771) 5 years ago
  nihui 6f26f57d9c deconv4x4s2 arm fp16sa 5 years ago
  nihui bf09af21be exp arm fp16sa neon optimization 5 years ago
  nihui 72a27d4776 utility wrapper for neon float32 bfloat16 conversion, deconvolution deconvolutiondepthwise arm fp16s fp16sa bf16s 5 years ago
  nihui b5e288b521 layer creator function is not necessary for built-in layers 5 years ago
  nihui 01b8b79ed2 packing layout option respect support_packing property 6 years ago
  nihui 3ef995ed1e
format code style and setup restyled.io (#1840) 6 years ago
  Naiyang Lin ceef2470a5
Add logger.h (#1753) 6 years ago
  nihui 57bedd59fa fix build without neon 6 years ago
  nihui 038666e049
the initial auto test (#1464) 6 years ago
  nihuini cd4be6d0fa call vulkan create_pipeline on the vkdev condition, drop opt_cpu hacks 6 years ago
  nihui 48e3e7d49c move neon activation into a wrapper function 6 years ago
  nihuini b7085ceec0 deconvolution apply output adj first, then crop the padding 6 years ago
  nihuini 296e0022df deconvolution output adj and output shape 6 years ago
  nihuini 9a6ee37eef asymmetric padding parameter for convolution and deconvolution family 6 years ago
  nihui 394f6786b9 neon enable support_packing 6 years ago
  nihui cf42e7c254 deconvolutiondepthwise pack4 arm neon 6 years ago
  nihui b4c388a72a Mat misc function accept option parameter, deconvolution pack4 arm neon 6 years ago
  BUG1989 d9f269fa3d use sgemm fp32 on arm platform,optimize conv1x1s2 (#1031) 7 years ago
  nihuini 4de4078779 move platform includes out of namespace 7 years ago
  nihui 3e003ffd98 fuse sigmoid 7 years ago
  nihuini 7a8f68aca6 move vulkan code to subdir, new layer interface create_pipeline and destroy_pipeline for post-loading works 7 years ago
  nihuini c6e075cef7 fuse deconv/innerproduct relu arm 7 years ago
  nihui 9706cd1447 implement ncnn blob/workspace allocator, fine-grained per-layer openmp threads control, fix #469 7 years ago
  nihuini 76a55693a6 decouple convolutiondepthwise and convolution, reduce binary size by 10%, fix #254 8 years ago
  nihui bdb70a2010 padding w h in convolution and deconvolution 8 years ago
  nihui 44b4519307 non-square convolution and deconvolution kernel stride dilation 8 years ago
  tedder59 4d59d0afda Add depthwise Deconvolution. (#187) 8 years ago
  nihuini 531582abf9 arm neon optimize for deconvolution 3x3, first try 8 years ago