685 Commits (a3a2548aa28a9ff7924f76d5085fabe94de79ddb)
 

Author SHA1 Message Date
  nihuini ff0e8c85c5 bind the same pipeline may cause driver incorrectly optimize into one, use two pipelines to always change the current one 7 years ago
  BUG1989 df3d224484 new int8 implement,better accuracy (#749) 7 years ago
  nihuini 4ac56d3c1c unified memory index is not mandatory, sanity check 7 years ago
  nihuini 5f0ee22a33 treat as unified memory architecture if memory heap is same 7 years ago
  nihui 253bef2f7b example now runs on gpu when vulkan enabled 7 years ago
  nihui d85775fbcd fix softmax axis order on 3-dim, fix caffe reshape conversion, regenerate ssd param 7 years ago
  nihui 979ed57487 packing param for identity packing when padding disabled, auto packing conversion between cpu and gpu blob 7 years ago
  nihui b49cb56ad9 constify vulkan device handle, use default local vulkan device if not specified 7 years ago
  nihui 5e07749a4a do not emit upload transfer on unified memory 7 years ago
  nihui 182c340b3a enable ssd vulkan benchmark 7 years ago
  nihui 9ebac3fe9e dedicated reference counter for staging data 7 years ago
  nihui 68afd1fa17 reset fence 7 years ago
  nihui 81ee56b209 copy buffer has offset alignment limit, re-implement concat as compute pipeline 7 years ago
  nihuini f162de7263 drop deprecated hack 7 years ago
  nihuini 83efa73cf6 fallback to cpu forward if layer not support vulkan, automatically! 7 years ago
  nihuini bdd305638d command reset 7 years ago
  nihuini 10a088397e concat interleave image row 7 years ago
  nihuini 1ace8068e3 zero detected is not error 7 years ago
  nihuini ab4c94aea9 fix cpu-only build 7 years ago
  nihuini b54e115f6e enable mobilenet-yolo mobilenet-yolov3 vulkan benchmark 7 years ago
  nihuini 14efdd8e00 reorg shader 7 years ago
  nihui 723d326760 enable shufflenet and vgg16 in vulkan benchmark 7 years ago
  nihui b62e9c4b1e shufflechannel shader 7 years ago
  nihuini bb04055e80 permute shader 7 years ago
  nihui 24f423b0c6 fix build on msvc 7 years ago
  nihui cc4376d8e6 do not upload unnecessary pack1 weight, reduce gpu memory usage 7 years ago
  nihui 0ad0c07526 drop duplicated weight data in convolution-fc, use the more light-weight pipelines 7 years ago
  nihuini 43c4b57201 group deconvolution packing family 7 years ago
  nihuini 8547864b6f group convolution packing family 7 years ago
  nihuini 675fcc72a5 interp vulkan 7 years ago
  nihuini 37413ea95c implement depthwise deconvolution vulkan, fix top blob state 7 years ago
  nihuini 468516879f implement deconvolution vulkan family support 7 years ago
  nihuini e213605cd4 reduce memory usage of weight packing 7 years ago
  nihuini 7312887671 transfer command hold data context 7 years ago
  nihuini 4a57f88c3c vkcompute auto begin end, use proper alignment for vktransfer staging buffer offset 7 years ago
  nihuini 39f2c71d5b fix name conflict on ios 7 years ago
  nihui f4e12101c0 fix convolution typed innerproduct pack4 7 years ago
  nihui 0acdbebf3b merge refcount into buffer memory cookie 7 years ago
  nihui 960ffa1a50 optimize workgroup size for convolution depthwise and innerproduct pack4 7 years ago
  nihui a15b389d86 fix innerproduct pack1to4 pack4to1 weight upload 7 years ago
  Emmanuel Benazera a8fd79e1bc fixed cell initialization in LSTM layer 7 years ago
  nihui 62543f9b1e flatten pack1to4 7 years ago
  nihui 9480dcbc36 fix innerproduct out packing 7 years ago
  nihui f9dc551081 add innerproduct pack1to4 pack4to1 glue code 7 years ago
  nihui 3f91d6b529 add innerproduct pack1to4 pack4to1 shader 7 years ago
  nihui cd7f120250 lrn norm across channel pack4, rename member name with pipeline prefix 7 years ago
  nihui 7ee3216fff add convolution pack1to4 pack4to1 7 years ago
  nihui 9d2b345eab lrn region within channel pack4 7 years ago
  nihui ad68e1e0e6 enable googlenet alexnet vulkan benchmark, fix build on msvc 7 years ago
  nihui 559183904b fix random crash on dedicated allocation 7 years ago