163 Commits (d38871bbfc38048d904efe50099bc2b1b7901bc1)

Author SHA1 Message Date
  Cai Shanli a9df4f6c59
add custom layer destroyer (#2481) 5 years ago
  Martin Han b441f738bd
Extract on CPU without pack/fp16fp32 (#2288) 5 years ago
  PENGUINLIONG 8f8f2de4d0
SSE2 optimization pack (#2123) 5 years ago
  nihui cf3cf83cd3
unified image shader storage type (#2231) 5 years ago
  nihuini b766c8cd9e fix potential divide by zero fault when bf16s / fp16s enabled, fix #2125 5 years ago
  nihuini a334513b5e fp16a option fix 5 years ago
  nihuini e841ae73c6 fix arm fp16s feat output, fix #2003 5 years ago
  nihui 54e79a62d7 fix crash on non-arm82 build 5 years ago
  nihui c173d51c9b mish sigmoid swish tanh arm fp16s 5 years ago
  nihui 71f86af8a6 fix non-arm82 ci 5 years ago
  nihui 9a2e2a6937 convert fp32 blobs for layers with fp16 storage support 5 years ago
  nihui 308145254e mask bf16 option in layer forward, disable gpu when bf16 enabled, fix #1962 5 years ago
  nihui 71dc13625f disable bf16 storage for int8 inference 5 years ago
  nihuini 4e4f0baa73 set openmp blocktime 20 for reducing power consumption, blocktime option 5 years ago
  nihui bb5bfe3841
avx2 infrastructure (#1943) 5 years ago
  nihui 11cffce114
armv8.2 infrastructure (#1856) 5 years ago
  nihui 164273de61
online pipeline cache (#1792) 5 years ago
  Tijmen Verhulsdonck d1b5711791
X86 Elempack 8 AVX implementations. (#1853) 6 years ago
  nihui 3ef995ed1e
format code style and setup restyled.io (#1840) 6 years ago
  nihuini 64985809a3 fix crash in load_model when gpu is not used 6 years ago
  nihuini 116869594c fix cpu-only build 6 years ago
  nihuini aeba24b371 enable implicit fp16a on arm mali variants, add bug tag for layout binding id alias 6 years ago
  nihuini 054ec09195 adreno device blacklist 6 years ago
  nihuini c94d1b39ad force diable image storage on macos and ios, fix #1738 6 years ago
  Naiyang Lin ceef2470a5
Add logger.h (#1753) 6 years ago
  nihui 9a9a618229 image storage is mandatory, less options makes life easier 6 years ago
  nihuini 041437ef48 seperate packing cast type to more shaders 6 years ago
  nihui 62da1228e1
adreno image shader + fp16 + fp16a (#1714) 6 years ago
  nihui 7365bb80a2
vkmat and command api breaks (#1689) 6 years ago
  nihui 7d1eec3d5d the use_bf16_storage option 6 years ago
  xieydd b760e22da2
fix requant relu6 bug (#1590) 6 years ago
  nihui 52ce59e672 fix build with requant option on 6 years ago
  nihui 0f7e7bca02
shader shape specialization constant and basic local group size partition (#1523) 6 years ago
  nihui e2bd4eae6e write shape as 4-number tuple 6 years ago
  nihui 6cefaad957 ncnnoptimize shape inference, load shape hint 6 years ago
  nihui a718129d76 shader pack8 option works 6 years ago
  nihui 6f2ef1932d int8 code refactoring wip, add int8 test 6 years ago
  Anton Kochkov 07170542c9 Fix GCC 9.x warnings (#1462) 6 years ago
  Sungmann Cho 9bfc554bc9 Fix warnings on Visual Studio (#1431) 6 years ago
  nihuini 3c9b3074e4 reclaim local vulkan allocator after blob_mats_gpu clear, fix random crash in multithread gpu inference without explicit per-thread allocator set 6 years ago
  nihuini 50e8b5e4e8 multiple transfers may run concurrently if there is no dependency with each other, do not share staging buffer memory to fix potential data race 6 years ago
  nihuini 33956cbfc3 pretty error info 6 years ago
  nihuini a170ef1acf remove the default option usage in layer interface, fix write out of range in cast arm pack4, handle fp16p conversion on cpu/gpu transfer 6 years ago
  nihuini e73b06bbb8 fix build with NCNN_STRING=OFF 6 years ago
  nihuini 64333429bb data reader wrapper, fix #1325 6 years ago
  nihui 8c1b87b1a2 fallback to cpu if no vulkan device found 6 years ago
  Natsu 637d96c1d2 Fix gcc 9 compilation failure (#1189) 6 years ago
  nihui ff62e7eed9 use_packing_layout option works 6 years ago
  nihui b4c388a72a Mat misc function accept option parameter, deconvolution pack4 arm neon 6 years ago
  nihui 8c53706987 net vkdev getter api 7 years ago