112 Commits (db035d602de6ec0cd3bdd191cb21f4b73e7599be)

Author SHA1 Message Date
  nihui db035d602d
update ncnnoptimize layers, lightmode=false keeps original weight (#5414) 2 years ago
  Sophon 294e786d36
convolution_x86: Fix typo in logging (#5310) 2 years ago
  nihui 556b79ce4d
create layer decoupled (#5258) 2 years ago
  nihui 4494aadd74
deconvolution dynamic weight (#5119) 2 years ago
  nihui 7b02425246
x86 optimization for convolution int8 winograd unified elempack (#5054) 2 years ago
  nihui 9ecf6a61be
x86 optimization for convolution int8 gemm unified elempack (#4881) 2 years ago
  nihui 55709708e9
x86 optimization for convolution int8 packed unified elempack (#4861) 2 years ago
  nihui 2e3e680d77
x86 optimization for packed convolution unified elempack (#4469) 3 years ago
  nihui bd5bbe3f2c
x86 optimization for winograd unified elempack part2 (#4470) 3 years ago
  nihui 88274827da
x86 optimization for winograd unified elempack (#4456) 3 years ago
  nihui 1f1981052c
convolution deconvolution and deformableconv2d x86 use sgemm (#4414) 3 years ago
  nihui 8eab5ea0ea
x86 sse2/avx2 optimization for convolution sgemm/winograd int8 family (#4286) 3 years ago
  nihui 20a14bf5ae
arm convolution winograd dot function, adjust arm convolution winograd strategy (#3915) 3 years ago
  nihui ca0ba4b25f
fine grained winograd options, adjust x86 convolution winograd strategy (#3908) 3 years ago
  nihui 241524ffce
discard weight memory for x86 arm vulkan (#3865) 4 years ago
  nihui 02a7e64e18
optimize x86 winograd input transform transpose (#3818) 4 years ago
  nihui bf64d8f1ec
fix winograd function name (#3820) 4 years ago
  nihui 131f3d1323
x86 avx512 optimization for convolution winograd pack16to1 and deconvolution family, increase simpleomp argv count (#3694) 4 years ago
  nihui 3d169b3237
x86 avx512 optimization (#3691) 4 years ago
  nihui 9298d05e86
split convolution winograd transform input output (#3688) 4 years ago
  nihui dadc640c66
x86 avx512 optimization (#3581) 4 years ago
  nihui 920aa79f04
drop x86 avx2 fp16 (#3568) 4 years ago
  nihuini 57a7101fc6
fix ci, second try 4 years ago
  nihuini cfedcfdc57
fix ci, first try 4 years ago
  nihui 3f2799d706
always build tightly packed weight, fix #3545 (#3547) 4 years ago
  nihui 139554b36e
rewrite convolution x86 sgemm pack1 (#3544) 4 years ago
  nihui fb6283c8b0
x86 avx fma optimization (#3543) 4 years ago
  nihui de77b669c4
x86 sse2 optimization for conv1x1/3x3 pack4 and general sgemm pack4/pack4to1 (#3538) 4 years ago
  nihui d95213a005
x86 convolution int8 optimization third stage (#3506) 4 years ago
  nihui c2896bcd4d
x86 convolution int8 optimization second stage (#3495) 4 years ago
  nihui e9b8f0a6ef
x86 avx2 optimization for convolution gemm int8 (#3489) 4 years ago
  nihui 6941ec8fc9
arm neon optimization for general packed convolution (#3426) 4 years ago
  nihui 999e640d43
dynamic convolution weight (#3408) 4 years ago
  nihui 24fbb6e8cb
honor thread setting on load and vulkan command, ci avx512 t4 (#3391) 4 years ago
  Tijmen Verhulsdonck ac5dc23ccc
added a number of optimized sse layers (#3302) 4 years ago
  zhiliu6 a08f700775
Optimize avx convolution activation (#3299) 4 years ago
  zhiliu6 814f89ef1a
Fuse HardSwish activation into Convolution and InnerProduct (#3233) 4 years ago
  Tijmen Verhulsdonck 4270b5c502
Fix broken codepaths with AVX only (#3254) 4 years ago
  Tijmen Verhulsdonck eaa7e24db6
Added ability to switch AVX/AVX2 during runtime (#3076) 4 years ago
  Evgeny Proydakov 9245cdca42
Fixed compile warnings for clang compiler on MacOS. [-Wunused-parameter] (#2998) 5 years ago
  nihuini 687cc857b1 some x86 sse2 optimization for convolution int8 5 years ago
  nihui 7e1aaa5828
cmake option NCNN_INT8 (#2839) 5 years ago
  nihui 1ea8bfbd2e x86 avx2 conv3x3s1 pack8 direct optimization, fix #2789 5 years ago
  nihui 5fe75f19ef
architecture changes for int8 packing (#2771) 5 years ago
  zhiliu6 57397c418d
Optimize general AVX2 convolution. (#2714) 5 years ago
  nihui 82c4acc187 conv1x1s1 and packing pack4 x86 optimization, fix #2510 fix #2509 5 years ago
  Zhuo Zhang f13035794a
fix convolution_x86*.cpp-shadowed-variables-warning (#2444) 5 years ago
  nihuini 1a3191e245 fix libncnn build with gcc-4.8 and gcc-4.4, fix #2388 5 years ago
  zhiliu6 25b224479c
optimize left over x86 convolution (#2378) 5 years ago
  nihui a071637064
optional sse2 (#2373) 5 years ago