Kenji Mouri
|
d802acd205
|
Add SSE and AVX implementation of atan2 in x86 targets. (#4633)
|
3 years ago |
nihui
|
5b5e9ea537
|
fix some pnnx build warnings (#4634)
|
3 years ago |
nihui
|
f8e32aba9c
|
fix pnnx gru rnn with optional output, fix #4608 (#4631)
|
3 years ago |
張小凡
|
d87e895a1f
|
Add get_gpu_instance() function and Organized the instance class codes. (#4630)
|
3 years ago |
nihui
|
8066c76bc5
|
pnnx complex data type and torch.stft family (#4627)
|
3 years ago |
Darren Cheng
|
176f8e12cb
|
remove platform.h.in aarch64 judgment (#4628)
|
3 years ago |
張小凡
|
772b13a1d1
|
Add three extension capability support check (#4626)
* Add some extension capability for vma
|
3 years ago |
Kenji Mouri
|
1936142aae
|
Add AVX and AVX512F implementation of asin, acos and atan in x86 targets. Fix typo for SSE2 implementation of asin in x86 targets. (#4621)
|
3 years ago |
caofx0418
|
643107533a
|
fix errors while build in Android Open Source Code (#4622)
|
3 years ago |
Kenji Mouri
|
5ca5209cd5
|
Add FMA optimization for SSE2 implementation of asin, acos and atan in x86 targets. (#4620)
|
3 years ago |
AlOa
|
22e86402f7
|
Read image from memory buffer for simpleocv (#4557)
|
3 years ago |
nihui
|
db628b1b99
|
allow overwriting built-in layer with custom layer (#4616)
|
3 years ago |
nihui
|
1133a18ca8
|
x86 and arm optimization for convolution1d packed unified elempack (#4615)
|
3 years ago |
張小凡
|
868ea52bea
|
update faq.md about gpu performance (#4614)
|
3 years ago |
nihui
|
85991e2e0e
|
test custom option, update ci (#4609)
* early return for cpu test
* make nvidia driver happy
* fix gemm x86 threading
|
3 years ago |
nihui
|
f34becf6fc
|
fix divide by zero in get optimal tile size mnk (#4610)
|
3 years ago |
nihui
|
2ce77ba918
|
fix mha gemm allocator race (#4611)
|
3 years ago |
nihui
|
804ac3421d
|
infrastructure and optimization for a53 and a55 (#4596)
* new api for detecting arm midr and a53 a55 arch info wrapper
* let a35 be a53 :P
* a53 bf16s
* detect running core
|
3 years ago |
Zhuo Zhang
|
a124c2a839
|
fix typos in citation and benchmark docs (#4604)
|
3 years ago |
inisis
|
f7de5a7dc2
|
update faq.md (#4584)
|
3 years ago |
nihui
|
a961ab992e
|
arm deconv matmul use gemm (#4594)
* arm deconv matmul use gemm
* reduce gemm armv7 register uses
|
3 years ago |
nihui
|
254eb8d0d4
|
blacklist fp16a on old adreno driver (#4587)
|
3 years ago |
nihui
|
06b97d7e69
|
fix exynos 9810 isa detection (#4585)
|
3 years ago |
nihui
|
5ac17df797
|
arm optimization for packed convolution unified elempack (#4590)
|
3 years ago |
nihui
|
8049623d31
|
pnnx convert torch.mm (#4589)
|
3 years ago |
inisis
|
37042b2174
|
update build doc for Centos users (#4583)
|
3 years ago |
dependabot[bot]
|
2f193ebe8f
|
Bump pypa/cibuildwheel from 2.12.0 to 2.12.1 (#4568)
Bumps [pypa/cibuildwheel](https://github.com/pypa/cibuildwheel) from 2.12.0 to 2.12.1.
- [Release notes](https://github.com/pypa/cibuildwheel/releases)
- [Changelog](https://github.com/pypa/cibuildwheel/blob/main/docs/changelog.md)
- [Commits](https://github.com/pypa/cibuildwheel/compare/v2.12.0...v2.12.1)
---
updated-dependencies:
- dependency-name: pypa/cibuildwheel
dependency-type: direct:production
update-type: version-update:semver-patch
...
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
|
3 years ago |
nihui
|
010d6772d6
|
softmax arm unified elempack and bf16/fp16 optimization (#4582)
* mha arm use softmax fp16
|
3 years ago |
nihui
|
c777bf09dc
|
arm convolution sgemm unified elempack (#4572)
* fuse im2col and packb tile
|
3 years ago |
nihui
|
6987efd950
|
fix scale avx512 (#4580)
|
3 years ago |
nihui
|
693535afc1
|
pnnx torch 2.0 (#4579)
* fix build with torch-2.0
* torch 2.0 new patterns
* add torch 2.0 ci
|
3 years ago |
Kenji Mouri
|
47879ea7ea
|
Add SSE2 implementation of atan in x86 targets. (#4575)
|
3 years ago |
Kenji Mouri
|
b314b3543d
|
Add SSE2 implementation of acos in x86 targets. (#4573)
|
3 years ago |
Kenji Mouri
|
328d2ca2c4
|
Add SSE2 implementation of asin in x86 targets. (#4570)
|
3 years ago |
Yoh
|
7573faae52
|
move floor and ceil sse_function from unaryOp to sse_mathfun (#4566)
|
3 years ago |
wzyforgit
|
6109669583
|
Update benchmark of 3A5000 (#4563)
|
3 years ago |
nihui
|
dabc4c065f
|
arm convolution winograd unified elempack (#4556)
* update f43 coeffs
* arm convolution winograd unified elempack
* disable bf16s test atm
* test gnu inline asm off
|
3 years ago |
nihui
|
6f08ec7397
|
use full date for macos pypi package (#4552)
* use full date for pypi package
* split version date string only for dylib
|
3 years ago |
nihui
|
ae4f630467
|
pnnx fuse multiheadattention (#4544)
* torch baddbmm
* always convert to fp32 for shape inference
* silence info on nonetype and devicetype
|
3 years ago |
dependabot[bot]
|
626aa7cb31
|
Bump actions/checkout from 2 to 3 (#4550)
Bumps [actions/checkout](https://github.com/actions/checkout) from 2 to 3.
- [Release notes](https://github.com/actions/checkout/releases)
- [Changelog](https://github.com/actions/checkout/blob/main/CHANGELOG.md)
- [Commits](https://github.com/actions/checkout/compare/v2...v3)
---
updated-dependencies:
- dependency-name: actions/checkout
dependency-type: direct:production
update-type: version-update:semver-major
...
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
|
3 years ago |
nihui
|
c68266efd0
|
pnnx eliminate reshape shape expression for only one dynamic dimsize (#4548)
|
3 years ago |
WuJinxuan
|
ff80ac2955
|
[ARM] Multiheadattention (#4463)
|
3 years ago |
nihui
|
bbc770079e
|
silence fopen error on sysfs cache files
|
3 years ago |
nihui
|
6f661f9bc4
|
Update FAQ-ncnn-throw-error.md
|
3 years ago |
tpoisonooo
|
0676684f13
|
Create CITATION.cff (#4526)
Co-authored-by: nihui <shuizhuyuanluo@126.com>
|
3 years ago |
nihui
|
afc9310c62
|
update new operators for modelwriter (#4540)
|
3 years ago |
nihui
|
3e09196237
|
ci create release (#4539)
* update ci create release
* update macos ci image
* install 32bit sdk
|
3 years ago |
nihui
|
47ea2877ed
|
stb and emsdk update (#4536)
* stb_image_write 1.16
* stb_image v2.28
* update emsdk 3.1.28
* enable stb arm neon
* update doc
Co-authored-by: ncnnnnn <67086033+ncnnnnn@users.noreply.github.com>
|
3 years ago |
nihui
|
d0c2738043
|
update riscv winograd f43 coeffs and fix some warnings (#4537)
* update winograd f43 coeffs
* rvv tanh rework
* fix warnings
* rebuild qemu
|
3 years ago |
WuJinxuan
|
6572da3533
|
[x86] GroupNorm (#4471)
Co-authored-by: EdVince <EdVince@users.noreply.github.com>
|
3 years ago |