40 Commits (4d2d625432e8fdaaaa33042f31ceb6071eef6809)

Author SHA1 Message Date
  nihui 11cffce114
armv8.2 infrastructure (#1856) 5 years ago
  nihuini 0d6cc01d55 innerproduct handle mish activation, fix naive C testing, fix #1930 5 years ago
  nihui b5e288b521 layer creator function is not necessary for built-in layers 5 years ago
  Tijmen Verhulsdonck a91a18b901
AVX innerproduct and pooling 2x2 versions (#1839) 6 years ago
  nihui 3ef995ed1e
format code style and setup restyled.io (#1840) 6 years ago
  Xu Yang dbd9cbab4a
fix layer innerproduct when build with requant option on (#1624) 6 years ago
  nihui 6f2ef1932d int8 code refactoring wip, add int8 test 6 years ago
  Sungmann Cho c62e2702b3 Fix warnings on Visual Studio (#1456) 6 years ago
  nihuini 336d1c1edd remove the ncnn namespace for in source Option 6 years ago
  nihuini 567e2bd501 a dirty hack for resolving int8 pack4 crash 6 years ago
  nihuini cd4be6d0fa call vulkan create_pipeline on the vkdev condition, drop opt_cpu hacks 6 years ago
  BUG1989 bcfe9f453f initial the ncnn post training quantization tools (#1067) 7 years ago
  nihuini 838c5df839 option api changes 7 years ago
  nihui 3e003ffd98 fuse sigmoid 7 years ago
  nihuini 7a8f68aca6 move vulkan code to subdir, new layer interface create_pipeline and destroy_pipeline for post-loading works 7 years ago
  nihui be81ecf1f6 fix build on msvc 7 years ago
  nihuini 528fe8e9e3 gpu convolution/deconvolution/innerproduct fuse activation 7 years ago
  nihuini 3f85cafc08 fuse relu leakyrelu clip into convolution/deconvolution/innerproduct 7 years ago
  nihuini 433a92401a auto barrier in pipeline and copy command 7 years ago
  BUG1989 df3d224484 new int8 implement,better accuracy (#749) 7 years ago
  nihui cc4376d8e6 do not upload unnecessary pack1 weight, reduce gpu memory usage 7 years ago
  nihuini e213605cd4 reduce memory usage of weight packing 7 years ago
  nihui 960ffa1a50 optimize workgroup size for convolution depthwise and innerproduct pack4 7 years ago
  nihui a15b389d86 fix innerproduct pack1to4 pack4to1 weight upload 7 years ago
  nihui 9480dcbc36 fix innerproduct out packing 7 years ago
  nihui f9dc551081 add innerproduct pack1to4 pack4to1 glue code 7 years ago
  nihui 303996af4c auto flatten before innerproduct 7 years ago
  nihui f0b4933eac
massive simd optimize in compute shader (#772) 7 years ago
  nihui 10b8ac68cc
[WIP] vulkan compute (#618) 7 years ago
  BUG1989 7d2d18d31f innerproduce layer with int8 impl,the type of top_blob shoud be integer. (#578) 7 years ago
  nihuini 23de61fd07 as we already have the int8_scale_term switch, do not have to rely on the actual scale value 7 years ago
  nihuini 2dbaf6f7b7 store int8 scale in binary 7 years ago
  nihui 5d04a3a45c layer holds bottom blob scale, depthwise convolution read group scales 7 years ago
  nihui a169cec363 core int8 inference, quantize and dequantize, net using flag, caffe2ncnn reads int8 scale table 7 years ago
  nihui 9706cd1447 implement ncnn blob/workspace allocator, fine-grained per-layer openmp threads control, fix #469 7 years ago
  nihui 08e261f423 innerproduct produce continous blob, fix #236 8 years ago
  nihuini a84ba8fc0f element type storage support in Mat, move data member the first so that a pointer to Mat is a pointer to data, convenient index access for float vector 8 years ago
  nihui a181d25098 new model load api, fix #215 8 years ago
  nihui 1e2265dd99 new param load api 8 years ago
  nihuini b7db8be4f6 add ncnn source qwq 9 years ago