42 Commits (1469bc8b19b83d44206f36abfa3dc7377feeef69)

Author SHA1 Message Date
  nihuini 1ea9de3bdf create shader pipeline by type index, resolve binding count and push constant count from spirv. since we don't create compound shader module for macos and ios compatibility, it is enough to use fixed main as the shader entry point 6 years ago
  nihui f972bf49d1 enable bugihfa on rk3288 and rk3399 6 years ago
  nihui 7c97142524 old qcom adreno driver seems to have the same bug as mali does 6 years ago
  nihui bbaa4dcce2 compile fp16pa, optimize shader for size, enable implicit fp16 arithmetic for qcom855 and qcom855plus 6 years ago
  nihuini b361b24832 do not enforce coherent memory type, queue transfer after uploading model weight 6 years ago
  nihui 038666e049
the initial auto test (#1464) 6 years ago
  nihuini a477aee0ba print graphics queue info, const++ 6 years ago
  nihui 8a87f0267a workaround local workgroup size specialization constant bug for old arm mali vulkan driver, fix #1424 6 years ago
  nihui a867d96822 dynamic memory type querying, respect memory requirement memory type bits 6 years ago
  nihui 7e68c5e1e9 enable ycbcr conversion feature, get graphics queue 6 years ago
  nihui cb41b00e6e setup VK_KHR_bind_memory2 functions 6 years ago
  nihui b29e8b0e09 check and enable more vulkan extensions 6 years ago
  nihuini 21b5508c96 shared locked vkallocator cannot prevent concurrent accessing during actual gpu inference, use seperated vkallocator for each queue 7 years ago
  nihuini e9ffdb5bdd 16bit storage on arm mali is buggy 7 years ago
  nihuini 040a8d2427 set vulkan device by gpu index 7 years ago
  nihuini 5fdffbcaac destroy_gpu_instance is not threadsafe anyway, fix deadlock on exit 7 years ago
  nihuini 838c5df839 option api changes 7 years ago
  nihuini 7f7bbf12e5 new api for getting the default gpu device 7 years ago
  nihuini cd7559c639 more fix for fp16p, still disabled by default 7 years ago
  nihui 25b9736f82 shader fp16 packed 7 years ago
  nihuini 4b50a97e31 implement vulkan winograd23 7 years ago
  nihuini 37e150162a do not retrieve timestamp availabitliy bits 7 years ago
  nihuini 8e2fb2e710 expose timestamp_period and timestamp_valid_bits 7 years ago
  nihuini c9a9486307 merge command submit and wait, expose queue_count, concurrent queue submission shall work 7 years ago
  nihuini 3d06c40d10 fix build with vulkan header version 65, fix #907 7 years ago
  nihui f92dcca3b3 compiled spirv nearly always claim uniform buffer 8bit / 16bit access capability 7 years ago
  nihui c180e87502 add compile shader module function, create pipeline from custom shader spv data 7 years ago
  nihuini 31db9797df interp bicubic shader, initialize mat member with zero 7 years ago
  nihui 162c46647d do not create fp16 shader module on unsupported platform 7 years ago
  nihui 058bd65c88 fix fp16 shader creation 7 years ago
  nihuini 4e3df863d5 fix enable feature pointer 7 years ago
  nihuini 05bf09ba70 rename fp16_storage to support_fp16_storage 7 years ago
  nihuini 332722af63 fix fp16a int8a exchange oops 7 years ago
  nihuini e59dc6fafe proper usage of instance extension VK_KHR_get_physical_device_properties2, check fp16 and int8 feature 7 years ago
  nihuini d933f384b6 bump engine version 7 years ago
  nihuini 038389fa63 blacklist known buggy driver 7 years ago
  nihuini 5646b7d2c2 flatten image 7 years ago
  nihuini 1f4bdd91b5 uint32_t typed workgroup size 7 years ago
  nihuini 532054b453 expose more device info 7 years ago
  nihuini 4ac56d3c1c unified memory index is not mandatory, sanity check 7 years ago
  nihuini 5f0ee22a33 treat as unified memory architecture if memory heap is same 7 years ago
  nihui 10b8ac68cc
[WIP] vulkan compute (#618) 7 years ago