150 Commits (2bc77e7487d07a40667fcf9f8fffa17ca75e0523)

Author SHA1 Message Date
  nihuini 7a8f68aca6 move vulkan code to subdir, new layer interface create_pipeline and destroy_pipeline for post-loading works 7 years ago
  nihuini b81e1f3906 get rid of the old workaround :) 7 years ago
  nihuini 4729ea3505 bottom blob memory never alias, reuse blob memory more elegantly relying on refcount 7 years ago
  nihui 8724440c59 bind wait barrier count member to memory, fix #932 7 years ago
  nihui 162c46647d do not create fp16 shader module on unsupported platform 7 years ago
  nihui d753fe2589 upload fp16 weight, enable fp16 storage and arithmetic 7 years ago
  Gemfield add8c73922 Fix the return value of load_param and load_model (#855) 7 years ago
  Gemfield 573c2bcd93 Fix crash issue during load_model (#848) 7 years ago
  nihui caeb85d6cd multithreaded pipeline creation and destruction may cause driver crash :( 7 years ago
  nihuini b2e41bf83d fallback convolution to cpu path for pad -233 7 years ago
  nihuini d999f43b87 fix vulkan initialization using memory loading 7 years ago
  nihuini d263cd507c gpu packing and unpacking 7 years ago
  nihuini d3a11eb6c9 one codepath for unified and discrete device 7 years ago
  nihuini 433a92401a auto barrier in pipeline and copy command 7 years ago
  nihuini 1f4bdd91b5 uint32_t typed workgroup size 7 years ago
  BUG1989 df3d224484 new int8 implement,better accuracy (#749) 7 years ago
  nihui 979ed57487 packing param for identity packing when padding disabled, auto packing conversion between cpu and gpu blob 7 years ago
  nihui b49cb56ad9 constify vulkan device handle, use default local vulkan device if not specified 7 years ago
  nihui 5e07749a4a do not emit upload transfer on unified memory 7 years ago
  nihui 9ebac3fe9e dedicated reference counter for staging data 7 years ago
  nihuini 83efa73cf6 fallback to cpu forward if layer not support vulkan, automatically! 7 years ago
  nihuini 4a57f88c3c vkcompute auto begin end, use proper alignment for vktransfer staging buffer offset 7 years ago
  nihui f0b4933eac
massive simd optimize in compute shader (#772) 7 years ago
  nihui 10b8ac68cc
[WIP] vulkan compute (#618) 7 years ago
  nihui a577d71c12
Update net.cpp 7 years ago
  nihuini 099189384f fix load_param_bin, fix #732 7 years ago
  nihuini b2ffc339c0 reset internal_nconsumed_ptr before mem_scanf on msvc, fix #706 7 years ago
  Abdel Younes a941701f98 fix: c++ warnings (#666) 7 years ago
  nihuini 4e68a29eff fix build on msvc, second try 7 years ago
  nihuini 4a9e452cb5 fix build on msvc, first try 7 years ago
  xinyu391 666ec2820e add Net::load_param_mem(), ParamDict::load_param_mem() load plain text paramters from memory buf. (#587) 7 years ago
  nihui 5d04a3a45c layer holds bottom blob scale, depthwise convolution read group scales 7 years ago
  nihui 354b95256c bump param version, backward compatible 7 years ago
  nihuini 2bc504925e fix int8_scales from multiple blobs, fix #512 7 years ago
  nihui a169cec363 core int8 inference, quantize and dequantize, net using flag, caffe2ncnn reads int8 scale table 7 years ago
  nihui 9706cd1447 implement ncnn blob/workspace allocator, fine-grained per-layer openmp threads control, fix #469 7 years ago
  nihuini a341e7465c reject to load model with empty network, fix #392 8 years ago
  Hyungsuk Yoon 354f515596 scanf with width specifier need (width + 1) spaces 8 years ago
  nihui 6c4c810fda decouple modelbin of different input types, simplify timestamp function 8 years ago
  nihui d68eb4cd15 wrap benchmark gettimeofday 8 years ago
  Linghan Cheung 811b6ba1b6 print benchmark information for every layer, especially for CONVOLUTION (#241) 8 years ago
  nihui 10b86c2af5 create layer from type name 8 years ago
  nihui 7a43c45e80 remove deprecated code 8 years ago
  nihui a181d25098 new model load api, fix #215 8 years ago
  nihuini b84ba31c23 enable light mode by default 8 years ago
  nihui 1e2265dd99 new param load api 8 years ago
  nihui 87bad3420e improve custom layer register 8 years ago
  nihuini 8eba811e4b move layer type enum to separate header 8 years ago
  tedder59 a26914d6c8 注册用户网络层小bug 8 years ago
  nihuini b7db8be4f6 add ncnn source qwq 9 years ago