1966 Commits (991fca81ed63d2b76c1d40c05d772cb8f5bcfe7f)

Author SHA1 Message Date
  KRT 1b3bb3fa96 Fix popcount64 linking issue and improve compatibility 11 months ago
  KRT a356f6ef3f Fix aarch64-native simplestl-simplemath compilation 11 months ago
  KRT b6779e8d11 Fix compilation and test issues 11 months ago
  KRT 6105437380 Fix Windows ARM compatibility for popcount64 function 11 months ago
  yok7 cc76653c2a apply code-format changes 11 months ago
  KRT 252f30680e Fix NCNN_SIMPLESTL compatibility and improve bit shift safety 11 months ago
  KRT 58595848bd Add missing <utility> header for std::pair usage 11 months ago
  yok7 b18ec231fb apply code-format changes 11 months ago
  KRT e38e779f40 Fix uint64_t compilation errors and implement >64 CPU support 11 months ago
  KRT c475868e26 Support for >64 CPU systems 11 months ago
  nihui 8c5e23b625
use size_t type for proper large tensor support (#6201) 11 months ago
  Copilot 4644540ea4
Add Windows XP support merging PRs #6176 and #6177 (#6204) 11 months ago
  nihui fe509e9bc1
flexible coopmat mnk and unified elempack for vulkan deconvolution gemm (#6199) 11 months ago
  nihui 6c44404d34
use pre-downloaded codecov cli binary (#6198) 11 months ago
  nihui 8e69ddf707
flexible coopmat mnk and unified elempack for vulkan convolution winograd gemm (#6196) 11 months ago
  nihui 0c3dad7656
flexible coopmat mnk and unified elempack for vulkan convolution gemm (#6192) 11 months ago
  nihui d395000edc
flexible coopmat mnk and unified elempack for vulkan convolution 1x1s1d1 (#6154) 11 months ago
  nihui a1f5d5be47
fix unaryop on moltenvk (#6181) 11 months ago
  韦康琦 4f9f7d929e
vulkan unaryop unified elempack shader (#6179) 11 months ago
  Yexuan Wu e2d93a482e
Unified elempack activation function vulkan shader (#6175) 11 months ago
  Yexuan Wu 11b6990a9e
vulkan sigmoid unified elempack shader (#6170) 11 months ago
  Willaaaaaaa 04341120d4
feat: add cmake print ver support (#6165) 11 months ago
  Yexuan Wu 7abf84eb74
Fix win32 GetLogicalProcessorInformationEx API (#6169) 11 months ago
  nihui 0cfe201b3c
fix vulkan absval fp16 (#6167) 1 year ago
  AtomAlpaca f2970319d3
loongarch LASX optimization for math/trigonometric functions and sigmoid (#6163) 1 year ago
  chri321 4c72f52954
docs: update Chinese glsl-extension documentation (#6162) 1 year ago
  nihui 171b9d1bba
use spdx license header, copyright Tencent (#6152) 1 year ago
  nihui 075d07ede2
compute-only vulkan (#6131) 1 year ago
  zhuzeitou 10912b3b58
Use platform-specific APIs for environment variables (#6147) 1 year ago
  nihui 9f832c19c1
vulkan int8 packing quantize dequantize requantize (#3731) 1 year ago
  nihui 7557f5c208
vulkan absval unified elempack shader (#6132) 1 year ago
  nihui 626d9d0910
vulkan packing code clean, drop image storage type, unified fp16p fp16s packing (#6128) 1 year ago
  nihui bd0b111775
vulkan tight fp16p pack1 (#6127) 1 year ago
  nihui 24a3b99f1f
drop layer support_image_storage and option use_image_storage (#6126) 1 year ago
  nihui f168962a74
update glslang for fp8, fix fp8 enum (#6125) 1 year ago
  nihui abf0de4488
update ruapu to detect zfh zvfh xtheadvector (#5841) 1 year ago
  nihui 211e238639
drop layer forward vkimagemat (#6124) 1 year ago
  Yexuan Wu 1cd5373483
instancenorm x86 simd optimization (#6097) 1 year ago
  nihui cc40332804
discover VK_KHR_vulkan_memory_model (#6121) 1 year ago
  nihui 6e3052a88d
print gpu matrix property info (#6114) 1 year ago
  nihui 8998a13d06
discover VK_EXT_shader_float8 (#6120) 1 year ago
  nihui ef5e79d80e
add missing macros for VK_NV_cooperative_vector and VK_NV_cooperative_matrix2 (#6119) 1 year ago
  nihui 12f57fb3d1
discover VK_NV_cooperative_matrix2 (#6118) 1 year ago
  nihui 510b461e9a
discover VK_NV_cooperative_vector (#6117) 1 year ago
  nihui 9cdc02bb7a
unified vulkan khr/nv cooperative matrix shader (#6116) 1 year ago
  nihui b9f98f0d3a
always allocate aligned size for 1d/2d mat and vkmat (#6104) 1 year ago
  nihui 8a2eab1114
set localsize as multiple of subgroup size (#2483) 1 year ago
  nihui 87e8b5f4c1
use combine_x for sse/avx vector combination (#6113) 1 year ago
  nihui 23b64c9cf9
fix vulkan validation error, do not enforce local_size_x be multiple of subgroup size (#6111) 1 year ago
  nihui 4c4ecdf118
dequantize pack8 for all datatypes, fix convdw int8 dequant pack8 (#6109) 1 year ago