312 Commits (5646b7d2c2fc2afbc0a1e13ef8898b01680cf2f1)

Author SHA1 Message Date
  nihuini 5646b7d2c2 flatten image 7 years ago
  nihuini 2e939fab0f fix memleak 7 years ago
  nihui dd83284cee prelu shader 7 years ago
  nihui e50b339f04 clip shader 7 years ago
  nihui 69788b0467 reshape shader family 7 years ago
  nihui c41bcd98a3 priorbox shader, fix permute order 1 on image, fix potential staging memory leak 7 years ago
  nihuini 0956d26df1 add absval sigmoid tanh shader 7 years ago
  nihuini b5faa0e519 respect pad param in deconv vulkan 7 years ago
  BUG1989 8e337d440e fix the bug with convdw7x7 op working on int8 mode (#818) 7 years ago
  nihuini 4bc543d85c crop shader 7 years ago
  nihuini 9787625e4b warn users about the old wrong softmax behavior on axis not zero 7 years ago
  nihuini c54e57ed6f Merge branch 'master' of https://github.com/Tencent/ncnn 7 years ago
  nihuini 85a28959e4 fix binaryop shader binding, use shared buffer state, fix blob copy in non-light mode, fix #817 7 years ago
  BUG1989 8ff831f7cd fix the segmentation fault when load int8 model (#811) 7 years ago
  nihuini ff0e8c85c5 bind the same pipeline may cause driver incorrectly optimize into one, use two pipelines to always change the current one 7 years ago
  BUG1989 df3d224484 new int8 implement,better accuracy (#749) 7 years ago
  nihui d85775fbcd fix softmax axis order on 3-dim, fix caffe reshape conversion, regenerate ssd param 7 years ago
  nihui 979ed57487 packing param for identity packing when padding disabled, auto packing conversion between cpu and gpu blob 7 years ago
  nihui b49cb56ad9 constify vulkan device handle, use default local vulkan device if not specified 7 years ago
  nihui 81ee56b209 copy buffer has offset alignment limit, re-implement concat as compute pipeline 7 years ago
  nihuini 83efa73cf6 fallback to cpu forward if layer not support vulkan, automatically! 7 years ago
  nihuini 10a088397e concat interleave image row 7 years ago
  nihuini 1ace8068e3 zero detected is not error 7 years ago
  nihuini 14efdd8e00 reorg shader 7 years ago
  nihui b62e9c4b1e shufflechannel shader 7 years ago
  nihuini bb04055e80 permute shader 7 years ago
  nihui 24f423b0c6 fix build on msvc 7 years ago
  nihui cc4376d8e6 do not upload unnecessary pack1 weight, reduce gpu memory usage 7 years ago
  nihui 0ad0c07526 drop duplicated weight data in convolution-fc, use the more light-weight pipelines 7 years ago
  nihuini 43c4b57201 group deconvolution packing family 7 years ago
  nihuini 8547864b6f group convolution packing family 7 years ago
  nihuini 675fcc72a5 interp vulkan 7 years ago
  nihuini 37413ea95c implement depthwise deconvolution vulkan, fix top blob state 7 years ago
  nihuini 468516879f implement deconvolution vulkan family support 7 years ago
  nihuini e213605cd4 reduce memory usage of weight packing 7 years ago
  nihuini 39f2c71d5b fix name conflict on ios 7 years ago
  nihui f4e12101c0 fix convolution typed innerproduct pack4 7 years ago
  nihui 960ffa1a50 optimize workgroup size for convolution depthwise and innerproduct pack4 7 years ago
  nihui a15b389d86 fix innerproduct pack1to4 pack4to1 weight upload 7 years ago
  Emmanuel Benazera a8fd79e1bc fixed cell initialization in LSTM layer 7 years ago
  nihui 62543f9b1e flatten pack1to4 7 years ago
  nihui 9480dcbc36 fix innerproduct out packing 7 years ago
  nihui f9dc551081 add innerproduct pack1to4 pack4to1 glue code 7 years ago
  nihui 3f91d6b529 add innerproduct pack1to4 pack4to1 shader 7 years ago
  nihui cd7f120250 lrn norm across channel pack4, rename member name with pipeline prefix 7 years ago
  nihui 7ee3216fff add convolution pack1to4 pack4to1 7 years ago
  nihui 9d2b345eab lrn region within channel pack4 7 years ago
  nihui ad68e1e0e6 enable googlenet alexnet vulkan benchmark, fix build on msvc 7 years ago
  nihui f9ea621305 pooling full padding 7 years ago
  nihui ee59f14900 add lrn shader 7 years ago