2834 Commits (ae45d2860aeba8b32ef2bc187e62cdb45682cb48)
 

Author SHA1 Message Date
  tuduweb ae45d2860a
Correct README links about build and Add compatibility introduction (#4530) 3 years ago
  nihui 1832da8292
concat 4d (#4528) 3 years ago
  nihui fb9cf7982d
eltwise 4d (#4529) 3 years ago
  nihui 32e2de015e
slice 4d (#4525) 3 years ago
  nihui fc6ce4a641
copyto operator (#4522) 3 years ago
  nihui 242e775d21
pnnx convert torch log10, pow 2 as square (#4518) 3 years ago
  nihui 246e71c526
implement atan2 (#4516) 3 years ago
  Fangjun Kuang 92e75105c9
Support torch.cumsum (#4505) 3 years ago
  nihui ab4cfbf5b0
enrich ncnn binary broadcast rules (#4513) 3 years ago
  Yoh 529c83b8a5
Fuse pixel unshuffle (#4504) 3 years ago
  nihui 6869c81ed3
find cpu cache size from sysfs (#4502) 3 years ago
  nihui 17197b3c45
ci build with musl libc (#4499) 3 years ago
  nihui ce6b80a16b
pnnx flatten input tuple list (#4498) 3 years ago
  向阳 6af0f3777f
Remove redundant include (#4497) 3 years ago
  nihui 3b36656bc8
reduce vulkan winograd f43 transform shader register pressure (#4496) 3 years ago
  nihui dfbcd3e69b
improve vulkan winograd f43 fp16 numerical stability (#4492) 3 years ago
  weirdseed 503a8b921f
fix uninitialized gpu bug_buffer_image_load_zero value (#4493) 3 years ago
  nihui d2d012dce5
x86 bfloat16 cast functions (#4491) 3 years ago
  dependabot[bot] f046181766
Bump pypa/cibuildwheel from 2.11.4 to 2.12.0 (#4477) 3 years ago
  Hitesh Kumar add0a7bac4
fix : minor typo readme (#4486) 3 years ago
  nihui fed99fd35b
gemm output transpose, prepack c (#4479) 3 years ago
  li mengyang ff4d05a713
fix typo (#4482) 3 years ago
  nihui 2e3e680d77
x86 optimization for packed convolution unified elempack (#4469) 3 years ago
  nihui bd5bbe3f2c
x86 optimization for winograd unified elempack part2 (#4470) 3 years ago
  ws 643285a08c
fix macos vulkan instance create failed when vulkan sdk version >= 1.… (#4472) 3 years ago
  nihui 88274827da
x86 optimization for winograd unified elempack (#4456) 3 years ago
  nihui f0a91f46f5
update macos ci xcode version and vulkan sdk (#4465) 3 years ago
  WuJinxuan ad956c8c9c
[ARM] GELU (#4464) 3 years ago
  wyushun 68de8a2128
fix output_indexes name (#4453) 3 years ago
  Jiao Dian's Power Plant b07c5fc811
Remove unused imports in python (#4378) 3 years ago
  WuJinxuan 10e9d91576
Add x86 MultiHeadAttention (#4443) 3 years ago
  nihui 15761fc1a6
arm vfpv4 asimdhp asimdfhm optimization for gemm (#4432) 3 years ago
  nihui c471826da1
fix arm bfloat2float float2bfloat oops (#4439) 3 years ago
  dependabot[bot] b5884827d6
Bump pypa/cibuildwheel from 2.11.3 to 2.11.4 (#4438) 3 years ago
  nihui 88dba58992
fix gemm transpose B wrong result when tile N is not a multiple of 4, optimize load C (#4430) 3 years ago
  Yoh 0fffefd3c9
fix crop bug and eliminate Tensor.clone (#4416) 3 years ago
  nihui 7b3261dace
gemm arm optimization (#4426) 3 years ago
  inisis 62fc16d157
pnnx readme remove duplicate space (#4428) 3 years ago
  Fangjun Kuang 607c8f8332
Update README to include sherpa-ncnn for real-time speech recognition (#4424) 3 years ago
  mizu-bai c4574586ca
Add Example ncnn-fortran (#4423) 3 years ago
  nihui 5da70724b1
matmul x86 use sgemm (#4421) 3 years ago
  tpoisonooo edb70f5b35
Update README.md (#4419) 3 years ago
  tpoisonooo 8fea27fbb5
Update model-convert.md (#4352) 3 years ago
  wzyforgit e06081308b
Flush benchmark of some CPU model by tag 20221128 (#4418) 3 years ago
  nihui 1f1981052c
convolution deconvolution and deformableconv2d x86 use sgemm (#4414) 3 years ago
  nihui 9cc6eb1942
meet gemm x86 transpose alignment 3 years ago
  nihui 18fbaebe68
get cpu l2 cache size and resolve gemm tile size (#4411) 3 years ago
  nihui c5640a16c3
gemm x86 multiply alpha beta in post gemm stage, enable one_blob_only (#4407) 3 years ago
  nihui d48f712599
force NxK size the multiple of native simd length to fix mis-alignment 3 years ago
  nihui 2f8d1d4f9e
fix gemm x86 transpose b pack4 mis-alignment 3 years ago