46 Commits (acbaaa665b327a60f529f57f8ef15f6c7ebc8ffe)

Author SHA1 Message Date
  nihui 7886e90c65
split arm82 source for smaller binary and memory footprint (#3877) 4 years ago
  nihui 72c467d1d9
mips msa optimization for quantize dequantize requantize (#3672) 4 years ago
  nihui 920aa79f04
drop x86 avx2 fp16 (#3568) 4 years ago
  nihui 4654030541
decouple x86 fma avx2 (#3560) 4 years ago
  nihui 930c36ebe2
avx512 infrastructure (#3407) 4 years ago
  nihui 878cb713d5
optional arm82 dot source (#3415) 4 years ago
  nihuini 11794675f3
apple a11 and a12 do not support armv8.2 dotprod, restore the fp16-only optimized path 4 years ago
  nihuini affbefe311
some space cleanup, blob clone from allocator 4 years ago
  Tijmen Verhulsdonck eaa7e24db6
Added ability to switch AVX/AVX2 during runtime (#3076) 4 years ago
  nihui 1c31ac2549 runtime cpu dispatch for mips msa and loongson mmi 5 years ago
  nihui 2f70343aec
cmake clean (#3032) 5 years ago
  nihui bcbb55f033
apple device always has armv8.2 dot (#2963) 5 years ago
  nihuini afc02d57f9 runtime detect armv8.2 dotprod 5 years ago
  nihui 11958424c2 runtime riscv v and zfh dispatch, riscv v optimization for cast 5 years ago
  nihui 5fe75f19ef
architecture changes for int8 packing (#2771) 5 years ago
  nihui 54c0a13b9f
build shared library (#2525) 5 years ago
  nihui 1040f40c8b update c api for custom allocator datareader modelbin and layer registration, add cookie userdata to layer 5 years ago
  Cai Shanli a9df4f6c59
add custom layer destroyer (#2481) 5 years ago
  Leo 5afd318b86
Support remove libstdc++ denpendency (#2030) 5 years ago
  nihui 54e79a62d7 fix crash on non-arm82 build 5 years ago
  nihui bb5bfe3841
avx2 infrastructure (#1943) 5 years ago
  nihui 11cffce114
armv8.2 infrastructure (#1856) 5 years ago
  nihui 3ef995ed1e
format code style and setup restyled.io (#1840) 6 years ago
  Naiyang Lin ceef2470a5
Add logger.h (#1753) 6 years ago
  nihui 62da1228e1
adreno image shader + fp16 + fp16a (#1714) 6 years ago
  nihui 7365bb80a2
vkmat and command api breaks (#1689) 6 years ago
  nihui 7d1eec3d5d the use_bf16_storage option 6 years ago
  nihuini 6935b78926 new layer attribute support_packing 7 years ago
  Howave 123ca35e00 fix compile warnings (#1042) 7 years ago
  nihuini e09607bc22 add option to upload model function, pipeline creation honors option use flags, setting allocator per extractor do not make much sense 7 years ago
  nihuini 838c5df839 option api changes 7 years ago
  nihuini 7a8f68aca6 move vulkan code to subdir, new layer interface create_pipeline and destroy_pipeline for post-loading works 7 years ago
  nihuini 433a92401a auto barrier in pipeline and copy command 7 years ago
  nihuini 2672cd437f add layer type index member 7 years ago
  nihuini 85a28959e4 fix binaryop shader binding, use shared buffer state, fix blob copy in non-light mode, fix #817 7 years ago
  nihui b49cb56ad9 constify vulkan device handle, use default local vulkan device if not specified 7 years ago
  nihui f0b4933eac
massive simd optimize in compute shader (#772) 7 years ago
  nihui 10b8ac68cc
[WIP] vulkan compute (#618) 7 years ago
  nihui 9706cd1447 implement ncnn blob/workspace allocator, fine-grained per-layer openmp threads control, fix #469 8 years ago
  nihui 10b86c2af5 create layer from type name 8 years ago
  nihui a181d25098 new model load api, fix #215 8 years ago
  nihuini a4d28107f4 check clone empty 8 years ago
  nihui 15ad4dfb9f forward reuse forward_inplace routine, reduce binary size with little memcpy overhead in non-light mode 8 years ago
  nihui 1e2265dd99 new param load api 8 years ago
  nihui 87bad3420e improve custom layer register 8 years ago
  nihuini b7db8be4f6 add ncnn source qwq 9 years ago