108 Commits (254eb8d0d4b505ff81fa782aae33893e23b08f0a)

Author SHA1 Message Date
  nihui 254eb8d0d4
blacklist fp16a on old adreno driver (#4587) 3 years ago
  weirdseed 503a8b921f
fix uninitialized gpu bug_buffer_image_load_zero value (#4493) 3 years ago
  ws 643285a08c
fix macos vulkan instance create failed when vulkan sdk version >= 1.… (#4472) 3 years ago
  nihui c16cac2678
update glslang, fix system glslang include path (#3819) 4 years ago
  nihui 50fa6d39c0 enable fp16a for mali t760 v2 4 years ago
  nihui 7600270430
create uop in spirv-1 mode for vulkan 1.0 compatibility (#3721) 4 years ago
  nihui 9826f3dbf8
shader include vulkan activation, workaround for moltenvk tanh half4 issue (#3711) 4 years ago
  nihui 559e5b23f9
vulkan tensorcore optimization (#3628) 4 years ago
  nihui 3ddd65e18c
massive vulkan optimization part3 (#3632) 4 years ago
  nihui cfcb1cffa9
massive vulkan optimization part2 (#3621) 4 years ago
  nihui 8f25ba0cab enable fp16a on mali-g31 4 years ago
  nihui 30e106b185
add another mali g52 device id 4 years ago
  nihui 5f62fdec87 allow more concurrent gpu submits on device with low queue count 5 years ago
  nihui 81be8e235c
workaround macos intel dummy image readonly issue, fix #2548 (#2864) 5 years ago
  nihui 9fd4d371ae
bridge image for adreno image upload and download (#2658) 5 years ago
  nihuini 3bf03379d7 fix pipeline compilation error on image store fp16sa 5 years ago
  nihuini f437bcdd4c enable fp16s and int8s on newer adreno/mali, actually enable int8 tests 5 years ago
  nihui 80499bd64a enable VK_LAYER_KHRONOS_validation layer in modern vulkan sdk 5 years ago
  nihuini 9b949d65b3 fuse onnx lstm, codeformat exclude pybind11, fix #2562 5 years ago
  nihui 54c0a13b9f
build shared library (#2525) 5 years ago
  nihuini 5650b77054 fix gpu extension conditions 5 years ago
  nihui 1f44e5c6a3
enable ios arm64e (#2475) 5 years ago
  nihui 2b0b2fa388 enable more vulkan extensions, set subgroup size per vendor 5 years ago
  nihui cf3cf83cd3
unified image shader storage type (#2231) 5 years ago
  nihui 9be3f074a9
ci ndk-r16b (#2104) 5 years ago
  nihui b9296c259d
bring up vulkan 1.1 (#2191) 5 years ago
  nihui 4463c3b455 disable image shader on adreno until a better workaround figured out 5 years ago
  youzainn 1c5af3d83c
add device_name field for class GpuInfo (#2122) 5 years ago
  nihuini a334513b5e fp16a option fix 5 years ago
  nihuini 9047741129 always disable fp16/int8 arithmetic for gpu uop 5 years ago
  nihui 9f5b660483 compile spirv 5 years ago
  Leo 5afd318b86
Support remove libstdc++ denpendency (#2030) 5 years ago
  nihui 1322ae40cb
update engine version 5 years ago
  nihuini bf279dcf17 workaround corrupted pipeline cache on old qcom adreno 5 years ago
  nihui 11cffce114
armv8.2 infrastructure (#1856) 5 years ago
  nihui 193e08e834 lazy initialize utility operator, fix #1923 5 years ago
  nihui 27e099961c fix double gpu instance destruction 5 years ago
  nihui 164273de61
online pipeline cache (#1792) 5 years ago
  nihuini d2bf77cd88 create new allocator when pre-allocated allocators exhausted, fix #1862 5 years ago
  nihuini c38d304369 the implicit gpu instance makes life easier :) 6 years ago
  nihuini 187a3e672d implicit gpu instance destruction, fix #1849 6 years ago
  nihuini 9bb06e46cf implicit gpu instance creation, fix #1849 6 years ago
  nihuini fd7d87e098 allow linking with external glslang 6 years ago
  nihui 3ef995ed1e
format code style and setup restyled.io (#1840) 6 years ago
  nihuini 554890cda8 fp16p and fp16s cannot be both enabled in shader source 6 years ago
  nihuini 1a3a99d7c9 old qcom driver cannot handle binding id alias 6 years ago
  nihuini f87f21779f resolve cast from type properly, no more fp16p to/from fp16s conversion 6 years ago
  nihuini bb56b5439f fix vkmat download on integrated gpu, workaround priorbox fp16s with online spirv, fix #1700 fix #1805 6 years ago
  nihui 8fec0038ba fix ci test 6 years ago
  nihuini aeba24b371 enable implicit fp16a on arm mali variants, add bug tag for layout binding id alias 6 years ago