44 Commits (2bc77e7487d07a40667fcf9f8fffa17ca75e0523)

Author SHA1 Message Date
  nihui f10cc6dd93
initial data structure changes for 3dcnn, conv3d, pooling3d (#3378) 4 years ago
  nihui f86a307ab5
silence code scanning 4 years ago
  nihui 6a52e8e5f2
fix potential integer type overflow 4 years ago
  nihuini 6c2cee8186
fix mat clone with atypical source cstep 4 years ago
  nihuini 3631c1933d non-inlined addref and release slows down overall speed, move them to header 5 years ago
  nihui d7cbc055f3
fix illegal instruction on pi4 when NCNN_ARM82 enabled 5 years ago
  nihui 5fe75f19ef
architecture changes for int8 packing (#2771) 5 years ago
  restyled-io[bot] 5f00ba89d2
feat(ncnnoptimize): replace denormals to zero on layers with weights (#2690) 5 years ago
  nihui 79efe33fdc
cmake option for platform api uses (#2502) 5 years ago
  nihui e644164873 reshape arm bf16s fp16s, flatten api 5 years ago
  nihui 7f5047d1dc
Ci test end2end squeezenet (#1919) 5 years ago
  nihui 3ef995ed1e
format code style and setup restyled.io (#1840) 6 years ago
  tpoisonooo 8e1c3ac4d1
Add crop para check (#1825) 6 years ago
  Naiyang Lin ceef2470a5
Add logger.h (#1753) 6 years ago
  nihui 62da1228e1
adreno image shader + fp16 + fp16a (#1714) 6 years ago
  nihuini ee118e7d70 reconstruct import android hardwarebuffer api, wip 6 years ago
  nihui 44eb28fadc fix cast arm packing test 6 years ago
  nihui f214883203 cast between float32 and bfloat16 6 years ago
  nihui 7ae585f217 shape hint is elemsize aware 6 years ago
  nihui 0f7e7bca02
shader shape specialization constant and basic local group size partition (#1523) 6 years ago
  nihui 6f2ef1932d int8 code refactoring wip, add int8 test 6 years ago
  nihuini a86c2f44c3 vkimagemat, vkimageallocator, convenient construct from android hardware buffer 6 years ago
  nihuini a170ef1acf remove the default option usage in layer interface, fix write out of range in cast arm pack4, handle fp16p conversion on cpu/gpu transfer 6 years ago
  nihui b4c388a72a Mat misc function accept option parameter, deconvolution pack4 arm neon 6 years ago
  nihuini c4f23ae8ad rename Mat packing to elempack 6 years ago
  nihuini 838c5df839 option api changes 7 years ago
  nihuini dfffb29bb5 resize bicubic 7 years ago
  nihuini a4b74d27b0 move copy cut border function to operator 7 years ago
  nihuini 5a905c7cb9 implement substract_mean_normalize with bias and scale op 7 years ago
  nihuini c25c190703 move resize bilinear function to operator 7 years ago
  nihuini 43737b378f wrapper function for converting between fp32 and fp16 7 years ago
  nihui 8e5674363b
element packing (#770) 7 years ago
  nihuini bf1c58be46 padding is elemsize aware, copy_make_border is now a padding wrapper 7 years ago
  nihui 9706cd1447 implement ncnn blob/workspace allocator, fine-grained per-layer openmp threads control, fix #469 7 years ago
  nihuini ee98817446 proper first row/col handling in resize family, fix #429 8 years ago
  dong 6ea09ebf2c Use aarch64 assembly to replace arm intrinsics 8 years ago
  nihuini d2ee4e7d27 ld1 and st1 handle data endian mode per element 8 years ago
  nihuini a84ba8fc0f element type storage support in Mat, move data member the first so that a pointer to Mat is a pointer to data, convenient index access for float vector 8 years ago
  peng 39445b5233 no memcpy for small size copy_cut_border/copy_make_boder 8 years ago
  彭 a86cc8f620 memcpy optimize copy_cut_border/copy_make_boder (#179) 8 years ago
  nihui 908a8f48d2 assign same size 8 years ago
  nihuini 0edd2b78c5 arm neon optimize for bilinear_resize, about 40% faster 8 years ago
  nihuini 613028aa17 implement Mat resize_bilinear 8 years ago
  nihuini b7db8be4f6 add ncnn source qwq 9 years ago