57 Commits (d74f1e5654da4e405c10f5e1c2dacc68239dbd2f)

Author SHA1 Message Date
  nihui 171b9d1bba
use spdx license header, copyright Tencent (#6152) 11 months ago
  nihui 24a3b99f1f
drop layer support_image_storage and option use_image_storage (#6126) 1 year ago
  nihui abf0de4488
update ruapu to detect zfh zvfh xtheadvector (#5841) 1 year ago
  nihui 211e238639
drop layer forward vkimagemat (#6124) 1 year ago
  nihui a8e4db713b
initialize layer featmask zero (#6078) 1 year ago
  nihui 19caca3140
port rvv intrinsic 1.0+ (#5642) 1 year ago
  nihui 056509a034
fix create_pipeline crash in vulkan-enabled layer without calling load_param/load_model first (#5410) 2 years ago
  nihui 556b79ce4d
create layer decoupled (#5258) 2 years ago
  邓实诚 a1e3ebf8e5
implement simplemath (#4905) 2 years ago
  nihui 6c21b08727
check loongarch lasx and enable (#4820) 3 years ago
  junchao-loongson 279222c2c9
add vector optimization for loongarch64 (#4242) 3 years ago
  nihui 7886e90c65
split arm82 source for smaller binary and memory footprint (#3877) 4 years ago
  nihui 72c467d1d9
mips msa optimization for quantize dequantize requantize (#3672) 4 years ago
  nihui 920aa79f04
drop x86 avx2 fp16 (#3568) 4 years ago
  nihui 4654030541
decouple x86 fma avx2 (#3560) 4 years ago
  nihui 930c36ebe2
avx512 infrastructure (#3407) 4 years ago
  nihui 878cb713d5
optional arm82 dot source (#3415) 4 years ago
  nihuini 11794675f3
apple a11 and a12 do not support armv8.2 dotprod, restore the fp16-only optimized path 4 years ago
  nihuini affbefe311
some space cleanup, blob clone from allocator 4 years ago
  Tijmen Verhulsdonck eaa7e24db6
Added ability to switch AVX/AVX2 during runtime (#3076) 4 years ago
  nihui 1c31ac2549 runtime cpu dispatch for mips msa and loongson mmi 5 years ago
  nihui 2f70343aec
cmake clean (#3032) 5 years ago
  nihui bcbb55f033
apple device always has armv8.2 dot (#2963) 5 years ago
  nihuini afc02d57f9 runtime detect armv8.2 dotprod 5 years ago
  nihui 11958424c2 runtime riscv v and zfh dispatch, riscv v optimization for cast 5 years ago
  nihui 5fe75f19ef
architecture changes for int8 packing (#2771) 5 years ago
  nihui 54c0a13b9f
build shared library (#2525) 5 years ago
  nihui 1040f40c8b update c api for custom allocator datareader modelbin and layer registration, add cookie userdata to layer 5 years ago
  Cai Shanli a9df4f6c59
add custom layer destroyer (#2481) 5 years ago
  Leo 5afd318b86
Support remove libstdc++ denpendency (#2030) 5 years ago
  nihui 54e79a62d7 fix crash on non-arm82 build 5 years ago
  nihui bb5bfe3841
avx2 infrastructure (#1943) 5 years ago
  nihui 11cffce114
armv8.2 infrastructure (#1856) 5 years ago
  nihui 3ef995ed1e
format code style and setup restyled.io (#1840) 6 years ago
  Naiyang Lin ceef2470a5
Add logger.h (#1753) 6 years ago
  nihui 62da1228e1
adreno image shader + fp16 + fp16a (#1714) 6 years ago
  nihui 7365bb80a2
vkmat and command api breaks (#1689) 6 years ago
  nihui 7d1eec3d5d the use_bf16_storage option 6 years ago
  nihuini 6935b78926 new layer attribute support_packing 7 years ago
  Howave 123ca35e00 fix compile warnings (#1042) 7 years ago
  nihuini e09607bc22 add option to upload model function, pipeline creation honors option use flags, setting allocator per extractor do not make much sense 7 years ago
  nihuini 838c5df839 option api changes 7 years ago
  nihuini 7a8f68aca6 move vulkan code to subdir, new layer interface create_pipeline and destroy_pipeline for post-loading works 7 years ago
  nihuini 433a92401a auto barrier in pipeline and copy command 7 years ago
  nihuini 2672cd437f add layer type index member 7 years ago
  nihuini 85a28959e4 fix binaryop shader binding, use shared buffer state, fix blob copy in non-light mode, fix #817 7 years ago
  nihui b49cb56ad9 constify vulkan device handle, use default local vulkan device if not specified 7 years ago
  nihui f0b4933eac
massive simd optimize in compute shader (#772) 7 years ago
  nihui 10b8ac68cc
[WIP] vulkan compute (#618) 7 years ago
  nihui 9706cd1447 implement ncnn blob/workspace allocator, fine-grained per-layer openmp threads control, fix #469 8 years ago