119 Commits (6f5a4437d1029daef3e32af37a5350e577ea7858)

Author SHA1 Message Date
  nihuini c38d304369 the implicit gpu instance makes life easier :) 6 years ago
  nihuini 187a3e672d implicit gpu instance destruction, fix #1849 6 years ago
  nihuini 9bb06e46cf implicit gpu instance creation, fix #1849 6 years ago
  nihuini fd7d87e098 allow linking with external glslang 6 years ago
  nihui 3ef995ed1e
format code style and setup restyled.io (#1840) 6 years ago
  nihuini 554890cda8 fp16p and fp16s cannot be both enabled in shader source 6 years ago
  nihuini 1a3a99d7c9 old qcom driver cannot handle binding id alias 6 years ago
  nihuini f87f21779f resolve cast from type properly, no more fp16p to/from fp16s conversion 6 years ago
  nihuini bb56b5439f fix vkmat download on integrated gpu, workaround priorbox fp16s with online spirv, fix #1700 fix #1805 6 years ago
  nihui 8fec0038ba fix ci test 6 years ago
  nihuini aeba24b371 enable implicit fp16a on arm mali variants, add bug tag for layout binding id alias 6 years ago
  nihuini 054ec09195 adreno device blacklist 6 years ago
  nihuini 765003a615 fix build with old vulkan sdk 6 years ago
  nihuini 6788384595 query gpu heap budget api 6 years ago
  nihui 17c445480f
runtime spir-v compilation with libglslang (#1779) 6 years ago
  nihuini b71f22d074 report adreno info, benchncnn enable image storage on adreno 6 years ago
  nihuini c94d1b39ad force diable image storage on macos and ios, fix #1738 6 years ago
  SunTY 705dd36a31
simplestl is an alternative std vector string implementation (#1762) 6 years ago
  Naiyang Lin ceef2470a5
Add logger.h (#1753) 6 years ago
  nihuini 6682cd1638 image fp16pa, mark some bugihfa todo 6 years ago
  nihuini cefe8d38c3 dynamic image storage support from shape hint 6 years ago
  nihuini 1e4a0752b4 fix interp ci test 6 years ago
  nihui 9a9a618229 image storage is mandatory, less options makes life easier 6 years ago
  nihui e8688b042f fuse packing cast storage, binaryop image shader, dummy buffer and image, device-wide utility packing converter operators, fix multi-blob layer test 6 years ago
  nihui 62da1228e1
adreno image shader + fp16 + fp16a (#1714) 6 years ago
  nihuini 5580da4525 bump engine version 6 years ago
  nihui 7365bb80a2
vkmat and command api breaks (#1689) 6 years ago
  nihuini 1ea9de3bdf create shader pipeline by type index, resolve binding count and push constant count from spirv. since we don't create compound shader module for macos and ios compatibility, it is enough to use fixed main as the shader entry point 6 years ago
  nihui f972bf49d1 enable bugihfa on rk3288 and rk3399 6 years ago
  nihui 7c97142524 old qcom adreno driver seems to have the same bug as mali does 6 years ago
  nihui bbaa4dcce2 compile fp16pa, optimize shader for size, enable implicit fp16 arithmetic for qcom855 and qcom855plus 6 years ago
  nihuini b361b24832 do not enforce coherent memory type, queue transfer after uploading model weight 6 years ago
  nihui 038666e049
the initial auto test (#1464) 6 years ago
  nihuini a477aee0ba print graphics queue info, const++ 6 years ago
  nihui 8a87f0267a workaround local workgroup size specialization constant bug for old arm mali vulkan driver, fix #1424 6 years ago
  nihui a867d96822 dynamic memory type querying, respect memory requirement memory type bits 6 years ago
  nihui 7e68c5e1e9 enable ycbcr conversion feature, get graphics queue 6 years ago
  nihui cb41b00e6e setup VK_KHR_bind_memory2 functions 6 years ago
  nihui b29e8b0e09 check and enable more vulkan extensions 6 years ago
  nihuini 21b5508c96 shared locked vkallocator cannot prevent concurrent accessing during actual gpu inference, use seperated vkallocator for each queue 7 years ago
  nihuini e9ffdb5bdd 16bit storage on arm mali is buggy 7 years ago
  nihuini 040a8d2427 set vulkan device by gpu index 7 years ago
  nihuini 5fdffbcaac destroy_gpu_instance is not threadsafe anyway, fix deadlock on exit 7 years ago
  nihuini 838c5df839 option api changes 7 years ago
  nihuini 7f7bbf12e5 new api for getting the default gpu device 7 years ago
  nihuini cd7559c639 more fix for fp16p, still disabled by default 7 years ago
  nihui 25b9736f82 shader fp16 packed 7 years ago
  nihuini 4b50a97e31 implement vulkan winograd23 7 years ago
  nihuini 37e150162a do not retrieve timestamp availabitliy bits 7 years ago
  nihuini 8e2fb2e710 expose timestamp_period and timestamp_valid_bits 7 years ago