2886 Commits (d802acd205f8fbaba4edfbe1fd4fca0595f97784)
 

Author SHA1 Message Date
  Kenji Mouri d802acd205
Add SSE and AVX implementation of atan2 in x86 targets. (#4633) 3 years ago
  nihui 5b5e9ea537
fix some pnnx build warnings (#4634) 3 years ago
  nihui f8e32aba9c
fix pnnx gru rnn with optional output, fix #4608 (#4631) 3 years ago
  張小凡 d87e895a1f
Add get_gpu_instance() function and Organized the instance class codes. (#4630) 3 years ago
  nihui 8066c76bc5
pnnx complex data type and torch.stft family (#4627) 3 years ago
  Darren Cheng 176f8e12cb
remove platform.h.in aarch64 judgment (#4628) 3 years ago
  張小凡 772b13a1d1
Add three extension capability support check (#4626) 3 years ago
  Kenji Mouri 1936142aae
Add AVX and AVX512F implementation of asin, acos and atan in x86 targets. Fix typo for SSE2 implementation of asin in x86 targets. (#4621) 3 years ago
  caofx0418 643107533a
fix errors while build in Android Open Source Code (#4622) 3 years ago
  Kenji Mouri 5ca5209cd5
Add FMA optimization for SSE2 implementation of asin, acos and atan in x86 targets. (#4620) 3 years ago
  AlOa 22e86402f7
Read image from memory buffer for simpleocv (#4557) 3 years ago
  nihui db628b1b99
allow overwriting built-in layer with custom layer (#4616) 3 years ago
  nihui 1133a18ca8
x86 and arm optimization for convolution1d packed unified elempack (#4615) 3 years ago
  張小凡 868ea52bea
update faq.md about gpu performance (#4614) 3 years ago
  nihui 85991e2e0e
test custom option, update ci (#4609) 3 years ago
  nihui f34becf6fc
fix divide by zero in get optimal tile size mnk (#4610) 3 years ago
  nihui 2ce77ba918
fix mha gemm allocator race (#4611) 3 years ago
  nihui 804ac3421d
infrastructure and optimization for a53 and a55 (#4596) 3 years ago
  Zhuo Zhang a124c2a839
fix typos in citation and benchmark docs (#4604) 3 years ago
  inisis f7de5a7dc2
update faq.md (#4584) 3 years ago
  nihui a961ab992e
arm deconv matmul use gemm (#4594) 3 years ago
  nihui 254eb8d0d4
blacklist fp16a on old adreno driver (#4587) 3 years ago
  nihui 06b97d7e69
fix exynos 9810 isa detection (#4585) 3 years ago
  nihui 5ac17df797
arm optimization for packed convolution unified elempack (#4590) 3 years ago
  nihui 8049623d31
pnnx convert torch.mm (#4589) 3 years ago
  inisis 37042b2174
update build doc for Centos users (#4583) 3 years ago
  dependabot[bot] 2f193ebe8f
Bump pypa/cibuildwheel from 2.12.0 to 2.12.1 (#4568) 3 years ago
  nihui 010d6772d6
softmax arm unified elempack and bf16/fp16 optimization (#4582) 3 years ago
  nihui c777bf09dc
arm convolution sgemm unified elempack (#4572) 3 years ago
  nihui 6987efd950
fix scale avx512 (#4580) 3 years ago
  nihui 693535afc1
pnnx torch 2.0 (#4579) 3 years ago
  Kenji Mouri 47879ea7ea
Add SSE2 implementation of atan in x86 targets. (#4575) 3 years ago
  Kenji Mouri b314b3543d
Add SSE2 implementation of acos in x86 targets. (#4573) 3 years ago
  Kenji Mouri 328d2ca2c4
Add SSE2 implementation of asin in x86 targets. (#4570) 3 years ago
  Yoh 7573faae52
move floor and ceil sse_function from unaryOp to sse_mathfun (#4566) 3 years ago
  wzyforgit 6109669583
Update benchmark of 3A5000 (#4563) 3 years ago
  nihui dabc4c065f
arm convolution winograd unified elempack (#4556) 3 years ago
  nihui 6f08ec7397
use full date for macos pypi package (#4552) 3 years ago
  nihui ae4f630467
pnnx fuse multiheadattention (#4544) 3 years ago
  dependabot[bot] 626aa7cb31
Bump actions/checkout from 2 to 3 (#4550) 3 years ago
  nihui c68266efd0
pnnx eliminate reshape shape expression for only one dynamic dimsize (#4548) 3 years ago
  WuJinxuan ff80ac2955
[ARM] Multiheadattention (#4463) 3 years ago
  nihui bbc770079e
silence fopen error on sysfs cache files 3 years ago
  nihui 6f661f9bc4
Update FAQ-ncnn-throw-error.md 3 years ago
  tpoisonooo 0676684f13
Create CITATION.cff (#4526) 3 years ago
  nihui afc9310c62
update new operators for modelwriter (#4540) 3 years ago
  nihui 3e09196237
ci create release (#4539) 3 years ago
  nihui 47ea2877ed
stb and emsdk update (#4536) 3 years ago
  nihui d0c2738043
update riscv winograd f43 coeffs and fix some warnings (#4537) 3 years ago
  WuJinxuan 6572da3533
[x86] GroupNorm (#4471) 3 years ago