3544 Commits (a87d2d2891c0c448495c1ca614dcd8f6e27bd59b)
 

Author SHA1 Message Date
  nihuini a87d2d2891
bump version 11 months ago
  nihuini 23f23dace4
fix pnnx build on msvc 11 months ago
  nihui b10c9dd9a5
fix vulkan structure definition conflict with VK_KHR_acceleration_structure (#6205) 11 months ago
  nihui 8c5e23b625
use size_t type for proper large tensor support (#6201) 11 months ago
  Copilot 4644540ea4
Add Windows XP support merging PRs #6176 and #6177 (#6204) 11 months ago
  nihui fe509e9bc1
flexible coopmat mnk and unified elempack for vulkan deconvolution gemm (#6199) 11 months ago
  nihui 091f7da5ba
update spacemit toolchain and qemu test (#6200) 11 months ago
  nihui 6c44404d34
use pre-downloaded codecov cli binary (#6198) 11 months ago
  nihui 8e69ddf707
flexible coopmat mnk and unified elempack for vulkan convolution winograd gemm (#6196) 11 months ago
  nihui 0c3dad7656
flexible coopmat mnk and unified elempack for vulkan convolution gemm (#6192) 11 months ago
  nihui d395000edc
flexible coopmat mnk and unified elempack for vulkan convolution 1x1s1d1 (#6154) 11 months ago
  nihui a1f5d5be47
fix unaryop on moltenvk (#6181) 11 months ago
  韦康琦 4f9f7d929e
vulkan unaryop unified elempack shader (#6179) 11 months ago
  Yexuan Wu e2d93a482e
Unified elempack activation function vulkan shader (#6175) 11 months ago
  WANG KE 4d8220ce22
Skip int8 model on GPU and merge from upstream (#6174) 11 months ago
  GIBEREZ 44982d0d23
About the update to the GLSL documentation after the image functions are deprecated (#6173) 11 months ago
  Yexuan Wu 11b6990a9e
vulkan sigmoid unified elempack shader (#6170) 11 months ago
  Willaaaaaaa 04341120d4
feat: add cmake print ver support (#6165) 11 months ago
  Yexuan Wu 7abf84eb74
Fix win32 GetLogicalProcessorInformationEx API (#6169) 11 months ago
  nihui 0cfe201b3c
fix vulkan absval fp16 (#6167) 11 months ago
  Christopher 260f493ada
add cross toolchain cmake config of AK3918(AK) and SS928(hisi) (#6164) 11 months ago
  AtomAlpaca f2970319d3
loongarch LASX optimization for math/trigonometric functions and sigmoid (#6163) 1 year ago
  chri321 4c72f52954
docs: update Chinese glsl-extension documentation (#6162) 1 year ago
  dependabot[bot] cae9e636d0
Bump stefanzweifel/git-auto-commit-action from 5 to 6 (#6115) 1 year ago
  nihui 171b9d1bba
use spdx license header, copyright Tencent (#6152) 1 year ago
  nihui 075d07ede2
compute-only vulkan (#6131) 1 year ago
  zhuzeitou 10912b3b58
Use platform-specific APIs for environment variables (#6147) 1 year ago
  nihui 9f832c19c1
vulkan int8 packing quantize dequantize requantize (#3731) 1 year ago
  J. Zow ed6dcd0c81
Fix ci error for update linux-riscv64.yml (#6133) 1 year ago
  nihui 7557f5c208
vulkan absval unified elempack shader (#6132) 1 year ago
  nihui 626d9d0910
vulkan packing code clean, drop image storage type, unified fp16p fp16s packing (#6128) 1 year ago
  nihui bd0b111775
vulkan tight fp16p pack1 (#6127) 1 year ago
  nihui 24a3b99f1f
drop layer support_image_storage and option use_image_storage (#6126) 1 year ago
  nihui f168962a74
update glslang for fp8, fix fp8 enum (#6125) 1 year ago
  nihui abf0de4488
update ruapu to detect zfh zvfh xtheadvector (#5841) 1 year ago
  nihui 211e238639
drop layer forward vkimagemat (#6124) 1 year ago
  Yexuan Wu 1cd5373483
instancenorm x86 simd optimization (#6097) 1 year ago
  nihui cc40332804
discover VK_KHR_vulkan_memory_model (#6121) 1 year ago
  nihui 6e3052a88d
print gpu matrix property info (#6114) 1 year ago
  nihui 8998a13d06
discover VK_EXT_shader_float8 (#6120) 1 year ago
  nihui ef5e79d80e
add missing macros for VK_NV_cooperative_vector and VK_NV_cooperative_matrix2 (#6119) 1 year ago
  nihui 12f57fb3d1
discover VK_NV_cooperative_matrix2 (#6118) 1 year ago
  nihui 510b461e9a
discover VK_NV_cooperative_vector (#6117) 1 year ago
  nihui 9cdc02bb7a
unified vulkan khr/nv cooperative matrix shader (#6116) 1 year ago
  nihui b9f98f0d3a
always allocate aligned size for 1d/2d mat and vkmat (#6104) 1 year ago
  nihui 8a2eab1114
set localsize as multiple of subgroup size (#2483) 1 year ago
  nihui 87e8b5f4c1
use combine_x for sse/avx vector combination (#6113) 1 year ago
  nihui 23b64c9cf9
fix vulkan validation error, do not enforce local_size_x be multiple of subgroup size (#6111) 1 year ago
  nihui 4c4ecdf118
dequantize pack8 for all datatypes, fix convdw int8 dequant pack8 (#6109) 1 year ago
  hanzh 78b2e68728
arm unified elempack optimization for groupnorm (#4080) 1 year ago