168 Commits (41a9df5cd28778779e48cc78deb8468f3e8428da)

Author SHA1 Message Date
  邓实诚 a1e3ebf8e5
implement simplemath (#4905) 2 years ago
  nihui e80fcbca8f
prefer faster and larger device local only memory on amd integrated graphics, heap budget value follows the same strategy as blob allocator (#4936) 2 years ago
  nihui c45c01c7c1
enable VK_KHR_cooperative_matrix (#4823) 2 years ago
  Upliner Mikhalych e8645e9117
Don't silently ignore errors in VkCompute::submit_and_wait (#4828) 2 years ago
  nihui 15cf81c40d
workaround multiheadattention vulkan nan issue on nvidia gpu (#4682) 3 years ago
  nihui 72a3e5141f
fix vulkan validation error, prefer VK_KHR_buffer_device_address over VK_EXT_buffer_device_address (#4680) 3 years ago
  nihui e006aa8007
fix extension not present error (#4655) 3 years ago
  nihui a2106f840f
setup more extension entrypoint (#4636) 3 years ago
  張小凡 d87e895a1f
Add get_gpu_instance() function and Organized the instance class codes. (#4630) 3 years ago
  張小凡 772b13a1d1
Add three extension capability support check (#4626) 3 years ago
  nihui 254eb8d0d4
blacklist fp16a on old adreno driver (#4587) 3 years ago
  weirdseed 503a8b921f
fix uninitialized gpu bug_buffer_image_load_zero value (#4493) 3 years ago
  ws 643285a08c
fix macos vulkan instance create failed when vulkan sdk version >= 1.… (#4472) 3 years ago
  nihui c16cac2678
update glslang, fix system glslang include path (#3819) 4 years ago
  nihui 50fa6d39c0 enable fp16a for mali t760 v2 4 years ago
  nihui 7600270430
create uop in spirv-1 mode for vulkan 1.0 compatibility (#3721) 4 years ago
  nihui 9826f3dbf8
shader include vulkan activation, workaround for moltenvk tanh half4 issue (#3711) 4 years ago
  nihui 559e5b23f9
vulkan tensorcore optimization (#3628) 4 years ago
  nihui 3ddd65e18c
massive vulkan optimization part3 (#3632) 4 years ago
  nihui cfcb1cffa9
massive vulkan optimization part2 (#3621) 4 years ago
  nihui 8f25ba0cab enable fp16a on mali-g31 4 years ago
  nihui 30e106b185
add another mali g52 device id 4 years ago
  nihui 5f62fdec87 allow more concurrent gpu submits on device with low queue count 5 years ago
  nihui 81be8e235c
workaround macos intel dummy image readonly issue, fix #2548 (#2864) 5 years ago
  nihui 9fd4d371ae
bridge image for adreno image upload and download (#2658) 5 years ago
  nihuini 3bf03379d7 fix pipeline compilation error on image store fp16sa 5 years ago
  nihuini f437bcdd4c enable fp16s and int8s on newer adreno/mali, actually enable int8 tests 5 years ago
  nihui 80499bd64a enable VK_LAYER_KHRONOS_validation layer in modern vulkan sdk 5 years ago
  nihuini 9b949d65b3 fuse onnx lstm, codeformat exclude pybind11, fix #2562 5 years ago
  nihui 54c0a13b9f
build shared library (#2525) 5 years ago
  nihuini 5650b77054 fix gpu extension conditions 5 years ago
  nihui 1f44e5c6a3
enable ios arm64e (#2475) 5 years ago
  nihui 2b0b2fa388 enable more vulkan extensions, set subgroup size per vendor 5 years ago
  nihui cf3cf83cd3
unified image shader storage type (#2231) 5 years ago
  nihui 9be3f074a9
ci ndk-r16b (#2104) 5 years ago
  nihui b9296c259d
bring up vulkan 1.1 (#2191) 5 years ago
  nihui 4463c3b455 disable image shader on adreno until a better workaround figured out 5 years ago
  youzainn 1c5af3d83c
add device_name field for class GpuInfo (#2122) 5 years ago
  nihuini a334513b5e fp16a option fix 5 years ago
  nihuini 9047741129 always disable fp16/int8 arithmetic for gpu uop 5 years ago
  nihui 9f5b660483 compile spirv 5 years ago
  Leo 5afd318b86
Support remove libstdc++ denpendency (#2030) 5 years ago
  nihui 1322ae40cb
update engine version 5 years ago
  nihuini bf279dcf17 workaround corrupted pipeline cache on old qcom adreno 5 years ago
  nihui 11cffce114
armv8.2 infrastructure (#1856) 5 years ago
  nihui 193e08e834 lazy initialize utility operator, fix #1923 5 years ago
  nihui 27e099961c fix double gpu instance destruction 5 years ago
  nihui 164273de61
online pipeline cache (#1792) 5 years ago
  nihuini d2bf77cd88 create new allocator when pre-allocated allocators exhausted, fix #1862 5 years ago
  nihuini c38d304369 the implicit gpu instance makes life easier :) 6 years ago