46 Commits (cd4be6d0fadd6d01635a4fd3934d97e90e6f71ff)

Author SHA1 Message Date
  nihuini cd4be6d0fa call vulkan create_pipeline on the vkdev condition, drop opt_cpu hacks 6 years ago
  nihuini c0a4ffcf66 convolution pad_value param 6 years ago
  nihuini 296e0022df deconvolution output adj and output shape 6 years ago
  nihuini 0e26e3094e autopad SAME_LOWER 6 years ago
  nihuini 9a6ee37eef asymmetric padding parameter for convolution and deconvolution family 6 years ago
  nihui b4c388a72a Mat misc function accept option parameter, deconvolution pack4 arm neon 6 years ago
  tpoisonooo 1ca4387c9c Auto choose conv implementation (#1085) 6 years ago
  BUG1989 bcfe9f453f initial the ncnn post training quantization tools (#1067) 7 years ago
  nihuini 838c5df839 option api changes 7 years ago
  nihui 3e003ffd98 fuse sigmoid 7 years ago
  nihuini 7a8f68aca6 move vulkan code to subdir, new layer interface create_pipeline and destroy_pipeline for post-loading works 7 years ago
  nihuini 528fe8e9e3 gpu convolution/deconvolution/innerproduct fuse activation 7 years ago
  nihuini 3f85cafc08 fuse relu leakyrelu clip into convolution/deconvolution/innerproduct 7 years ago
  nihui 274392eb80 convolution padding same on gpu 7 years ago
  BUG1989 780c7d9a72 merge de/requantize op, optimize some int8 conv layer on arm64-v8a (#867) 7 years ago
  nihuini b2e41bf83d fallback convolution to cpu path for pad -233 7 years ago
  nihuini 433a92401a auto barrier in pipeline and copy command 7 years ago
  BUG1989 8e337d440e fix the bug with convdw7x7 op working on int8 mode (#818) 7 years ago
  BUG1989 df3d224484 new int8 implement,better accuracy (#749) 7 years ago
  nihui cc4376d8e6 do not upload unnecessary pack1 weight, reduce gpu memory usage 7 years ago
  nihui 0ad0c07526 drop duplicated weight data in convolution-fc, use the more light-weight pipelines 7 years ago
  nihuini e213605cd4 reduce memory usage of weight packing 7 years ago
  nihui f4e12101c0 fix convolution typed innerproduct pack4 7 years ago
  nihui 7ee3216fff add convolution pack1to4 pack4to1 7 years ago
  nihui ad68e1e0e6 enable googlenet alexnet vulkan benchmark, fix build on msvc 7 years ago
  nihui f0b4933eac
massive simd optimize in compute shader (#772) 7 years ago
  nihui 10b8ac68cc
[WIP] vulkan compute (#618) 7 years ago
  nihuini 23de61fd07 as we already have the int8_scale_term switch, do not have to rely on the actual scale value 7 years ago
  nihuini 6f1b0b0a61 quantized padding in convolution, use range sweets 7 years ago
  nihuini 2dbaf6f7b7 store int8 scale in binary 7 years ago
  nihui fe14037777 more sub op preload 7 years ago
  nihui 2fe7ada4d8 add arm int8 convolution stub, preload group op for x86 7 years ago
  nihui 5d04a3a45c layer holds bottom blob scale, depthwise convolution read group scales 7 years ago
  nihui a169cec363 core int8 inference, quantize and dequantize, net using flag, caffe2ncnn reads int8 scale table 7 years ago
  nihui 9706cd1447 implement ncnn blob/workspace allocator, fine-grained per-layer openmp threads control, fix #469 7 years ago
  nihui 7d1e49584d call Innerproduct for convolution on flattened blob 8 years ago
  nihuini a84ba8fc0f element type storage support in Mat, move data member the first so that a pointer to Mat is a pointer to data, convenient index access for float vector 8 years ago
  nihui a181d25098 new model load api, fix #215 8 years ago
  nihui bdb70a2010 padding w h in convolution and deconvolution 8 years ago
  nihui 44b4519307 non-square convolution and deconvolution kernel stride dilation 8 years ago
  nihui 1e2265dd99 new param load api 8 years ago
  nihuini 47218db6e5 fix minus padding SAME, fix #116 8 years ago
  nihuini 23630b14b9 implement tensorflow style padding SAME type for convolution and pooling, second try 8 years ago
  nihuini 320cbca902 implement tensorflow style padding SAME type for convolution and pooling 8 years ago
  nihuini 9bba77aa8e fix dilation convolution, fix #64 fix #75 8 years ago
  nihuini b7db8be4f6 add ncnn source qwq 9 years ago