66 Commits (057b5bb515d551fa64decdb7350422c19feba447)

Author SHA1 Message Date
  nihui b853b3d132
get_physical_cpu_count api family (#4302) 3 years ago
  LinHe 9426e21166
Memory Pool Improvement For Variadic Sized Inputs (#4190) 3 years ago
  nihui 1d0917c83b
fix build with very old gcc (#4048) 3 years ago
  xuehao.ma 962a49069a
add the param file of fastestdet in benchmark (#4026) 3 years ago
  tpoisonooo 6fd801b6d7
feat(src/layer): add vision_transformer benchmark (#3730) 4 years ago
  dog-qiuqiu 009d607a15
add the param file of yolo-fastest in benchmark (#3470) 4 years ago
  BUG1989 2112a4d7c3
add the param file of nanodet_m in benchmark (#3047) 4 years ago
  nihuini 26dc9820e4 custom mlir ncnn optimize pass, add efficientnetv2_b0 benchmark 5 years ago
  Cai Shanli 8cc8cd716a
Add get input and output names (#2890) 5 years ago
  nihui e4a4b51d27
openmp on webassembly (#2234) 5 years ago
  Evgeny Proydakov 8b0c46c45d
A single approach was used to suppress the msvc C4996 compiler warning [_CRT_SECURE_NO_WARNINGS] (#2208) 5 years ago
  fawdlstty 1d1cb29869
Fixed compile warning due to default cast (#2201) 5 years ago
  nihui 11cffce114
armv8.2 infrastructure (#1856) 5 years ago
  nihui fe6bc1ed4d
Ci rv64gcv and rv64gc (#1936) 5 years ago
  zhiliu6 cdbff653b8
Add yolov4 example option. (#1913) 5 years ago
  nihui 164273de61
online pipeline cache (#1792) 5 years ago
  nihuini 9bb06e46cf implicit gpu instance creation, fix #1849 6 years ago
  nihui 3ef995ed1e
format code style and setup restyled.io (#1840) 6 years ago
  Tijmen Verhulsdonck da09e5e7f1
Adding channel padding support for blazeface model. (#1826) 6 years ago
  nihuini 03a5378651 efficientnet-b0 and regnety-400m benchmark 6 years ago
  nihuini 054ec09195 adreno device blacklist 6 years ago
  nihuini d232272db0 lower end gpu friendly 6 years ago
  nihuini b71f22d074 report adreno info, benchncnn enable image storage on adreno 6 years ago
  nihui 9a9a618229 image storage is mandatory, less options makes life easier 6 years ago
  nihui 62da1228e1
adreno image shader + fp16 + fp16a (#1714) 6 years ago
  kalcohol 06e129d259
add skip cooling down option (#1566) 6 years ago
  nihui 6f2ef1932d int8 code refactoring wip, add int8 test 6 years ago
  Sungmann Cho 447b1369f5 Fix warnings on Visual Studio (#1422) 6 years ago
  nihuini 2e59da35a9 fill input and weight data with zero 6 years ago
  nihuini 02b07b3e43 update qcom810 and iphone5s benchmark 6 years ago
  nihuini 64333429bb data reader wrapper, fix #1325 6 years ago
  nihuini 567e2bd501 a dirty hack for resolving int8 pack4 crash 6 years ago
  nihuini f8caef7691 add shufflenet_v2 benchmark 6 years ago
  nihuini 9d4255a4a4 add mobilenet_v3 benchmark 6 years ago
  Eric Liu f5eee84185 Update mobilenetv2-yolov3 (#1165) 6 years ago
  Natsu 6d1944f2c3 CMake improvement (#1115) 6 years ago
  BUG1989 bcfe9f453f initial the ncnn post training quantization tools (#1067) 7 years ago
  nihuini 21b5508c96 shared locked vkallocator cannot prevent concurrent accessing during actual gpu inference, use seperated vkallocator for each queue 7 years ago
  nihuini 040a8d2427 set vulkan device by gpu index 7 years ago
  nihuini 838c5df839 option api changes 7 years ago
  nihuini 9b33e647bd use fixed blob names for benchmark 7 years ago
  nihuini 7a8f68aca6 move vulkan code to subdir, new layer interface create_pipeline and destroy_pipeline for post-loading works 7 years ago
  nihuini 20fb006282 coverage never works without proper unittest 7 years ago
  nihuini d263cd507c gpu packing and unpacking 7 years ago
  BUG1989 2f4c4a8202 fix the compile error when using armv7a without neon (#835) 7 years ago
  nihuini 1f4bdd91b5 uint32_t typed workgroup size 7 years ago
  BUG1989 df3d224484 new int8 implement,better accuracy (#749) 7 years ago
  nihui 182c340b3a enable ssd vulkan benchmark 7 years ago
  nihuini f162de7263 drop deprecated hack 7 years ago
  nihuini 83efa73cf6 fallback to cpu forward if layer not support vulkan, automatically! 7 years ago