14 Commits (5646b7d2c2fc2afbc0a1e13ef8898b01680cf2f1)

Author SHA1 Message Date
  nihui cc4376d8e6 do not upload unnecessary pack1 weight, reduce gpu memory usage 7 years ago
  nihuini e213605cd4 reduce memory usage of weight packing 7 years ago
  nihui f0b4933eac
massive simd optimize in compute shader (#772) 7 years ago
  nihui 10b8ac68cc
[WIP] vulkan compute (#618) 7 years ago
  nihui 9706cd1447 implement ncnn blob/workspace allocator, fine-grained per-layer openmp threads control, fix #469 8 years ago
  nihuini dd0ae756de batchnorm and scale on vector and image, fix #331 8 years ago
  nihui 08e261f423 innerproduct produce continous blob, fix #236 8 years ago
  nihuini a84ba8fc0f element type storage support in Mat, move data member the first so that a pointer to Mat is a pointer to data, convenient index access for float vector 8 years ago
  nihui a181d25098 new model load api, fix #215 8 years ago
  nihuini 25f19c2009 implement external scale blob, support SENet 8 years ago
  nihui 15ad4dfb9f forward reuse forward_inplace routine, reduce binary size with little memcpy overhead in non-light mode 8 years ago
  nihui d6c833a715 scale_data is vec 8 years ago
  nihui 1e2265dd99 new param load api 8 years ago
  nihuini b7db8be4f6 add ncnn source qwq 9 years ago