nihui
bbc770079e
silence fopen error on sysfs cache files
3 years ago
nihui
6f661f9bc4
Update FAQ-ncnn-throw-error.md
3 years ago
tpoisonooo
0676684f13
Create CITATION.cff ( #4526 )
Co-authored-by: nihui <shuizhuyuanluo@126.com>
3 years ago
nihui
afc9310c62
update new operators for modelwriter ( #4540 )
3 years ago
nihui
3e09196237
ci create release ( #4539 )
* update ci create release
* update macos ci image
* install 32bit sdk
3 years ago
nihui
47ea2877ed
stb and emsdk update ( #4536 )
* stb_image_write 1.16
* stb_image v2.28
* update emsdk 3.1.28
* enable stb arm neon
* update doc
Co-authored-by: ncnnnnn <67086033+ncnnnnn@users.noreply.github.com>
3 years ago
nihui
d0c2738043
update riscv winograd f43 coeffs and fix some warnings ( #4537 )
* update winograd f43 coeffs
* rvv tanh rework
* fix warnings
* rebuild qemu
3 years ago
WuJinxuan
6572da3533
[x86] GroupNorm ( #4471 )
Co-authored-by: EdVince <EdVince@users.noreply.github.com>
3 years ago
nihui
833f6ed8e4
c api for getting output indexes and names ( #4534 )
3 years ago
nihui
4fc977d7fd
fold scalar constant, write scalar attribute correctly ( #4533 )
3 years ago
tuduweb
ae45d2860a
Correct README links about build and Add compatibility introduction ( #4530 )
1. The correctly link is `how-to-build#build-for-linux` not `#pass-for-linux`, and so on other `#`
2. Add compatibility introduction for Raspberry Pi 4 (build pass)
3 years ago
nihui
1832da8292
concat 4d ( #4528 )
3 years ago
nihui
fb9cf7982d
eltwise 4d ( #4529 )
3 years ago
nihui
32e2de015e
slice 4d ( #4525 )
3 years ago
nihui
fc6ce4a641
copyto operator ( #4522 )
3 years ago
nihui
242e775d21
pnnx convert torch log10, pow 2 as square ( #4518 )
3 years ago
nihui
246e71c526
implement atan2 ( #4516 )
3 years ago
Fangjun Kuang
92e75105c9
Support torch.cumsum ( #4505 )
3 years ago
nihui
ab4cfbf5b0
enrich ncnn binary broadcast rules ( #4513 )
3 years ago
Yoh
529c83b8a5
Fuse pixel unshuffle ( #4504 )
3 years ago
nihui
6869c81ed3
find cpu cache size from sysfs ( #4502 )
* find cpu cache size from sysfs
* android l3
* make g_thread_affinity_mask singleton
* global mask
3 years ago
nihui
17197b3c45
ci build with musl libc ( #4499 )
3 years ago
nihui
ce6b80a16b
pnnx flatten input tuple list ( #4498 )
3 years ago
向阳
6af0f3777f
Remove redundant include ( #4497 )
3 years ago
nihui
3b36656bc8
reduce vulkan winograd f43 transform shader register pressure ( #4496 )
3 years ago
nihui
dfbcd3e69b
improve vulkan winograd f43 fp16 numerical stability ( #4492 )
3 years ago
weirdseed
503a8b921f
fix uninitialized gpu bug_buffer_image_load_zero value ( #4493 )
3 years ago
nihui
d2d012dce5
x86 bfloat16 cast functions ( #4491 )
* simplify cast fp16 avx512 dispatch
* define sse4.1 macro on msvc avx+
3 years ago
dependabot[bot]
f046181766
Bump pypa/cibuildwheel from 2.11.4 to 2.12.0 ( #4477 )
Bumps [pypa/cibuildwheel](https://github.com/pypa/cibuildwheel ) from 2.11.4 to 2.12.0.
- [Release notes](https://github.com/pypa/cibuildwheel/releases )
- [Changelog](https://github.com/pypa/cibuildwheel/blob/main/docs/changelog.md )
- [Commits](https://github.com/pypa/cibuildwheel/compare/v2.11.4...v2.12.0 )
---
updated-dependencies:
- dependency-name: pypa/cibuildwheel
dependency-type: direct:production
update-type: version-update:semver-minor
...
Signed-off-by: dependabot[bot] <support@github.com>
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
3 years ago
Hitesh Kumar
add0a7bac4
fix : minor typo readme ( #4486 )
3 years ago
nihui
fed99fd35b
gemm output transpose, prepack c ( #4479 )
* mha is now permute and reshape free
* gemm user defined tile mnk param
3 years ago
li mengyang
ff4d05a713
fix typo ( #4482 )
3 years ago
nihui
2e3e680d77
x86 optimization for packed convolution unified elempack ( #4469 )
3 years ago
nihui
bd5bbe3f2c
x86 optimization for winograd unified elempack part2 ( #4470 )
* improve gemm packb threading
* optimize tile size
* profile winograd condition
* handle threads changes
3 years ago
ws
643285a08c
fix macos vulkan instance create failed when vulkan sdk version >= 1.… ( #4472 )
* enable VK_KHR_portability_subset extension if device support it
Co-authored-by: w1ndseeker <w1ndseeker@users.noreply.github.com>
3 years ago
nihui
88274827da
x86 optimization for winograd unified elempack ( #4456 )
3 years ago
nihui
f0a91f46f5
update macos ci xcode version and vulkan sdk ( #4465 )
3 years ago
WuJinxuan
ad956c8c9c
[ARM] GELU ( #4464 )
3 years ago
wyushun
68de8a2128
fix output_indexes name ( #4453 )
3 years ago
Jiao Dian's Power Plant
b07c5fc811
Remove unused imports in python ( #4378 )
3 years ago
WuJinxuan
10e9d91576
Add x86 MultiHeadAttention ( #4443 )
* fix doc, sync x86 gemm fix
Co-authored-by: EdVince <EdVince@users.noreply.github.com>
Co-authored-by: nihuini <nihuini@tencent.com>
3 years ago
nihui
15761fc1a6
arm vfpv4 asimdhp asimdfhm optimization for gemm ( #4432 )
3 years ago
nihui
c471826da1
fix arm bfloat2float float2bfloat oops ( #4439 )
3 years ago
dependabot[bot]
b5884827d6
Bump pypa/cibuildwheel from 2.11.3 to 2.11.4 ( #4438 )
Bumps [pypa/cibuildwheel](https://github.com/pypa/cibuildwheel ) from 2.11.3 to 2.11.4.
- [Release notes](https://github.com/pypa/cibuildwheel/releases )
- [Changelog](https://github.com/pypa/cibuildwheel/blob/main/docs/changelog.md )
- [Commits](https://github.com/pypa/cibuildwheel/compare/v2.11.3...v2.11.4 )
---
updated-dependencies:
- dependency-name: pypa/cibuildwheel
dependency-type: direct:production
update-type: version-update:semver-patch
...
Signed-off-by: dependabot[bot] <support@github.com>
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
3 years ago
nihui
88dba58992
fix gemm transpose B wrong result when tile N is not a multiple of 4, optimize load C ( #4430 )
3 years ago
Yoh
0fffefd3c9
fix crop bug and eliminate Tensor.clone ( #4416 )
* fix crop bug and eliminate Tensor.clone
* fix crop slice bug on msvc
* fix slice bug on msvc
3 years ago
nihui
7b3261dace
gemm arm optimization ( #4426 )
* cmake determine target 32bit and 64bit
* include opt source with non-runtime cpu
* check compiler support gnu style inline assembly
3 years ago
inisis
62fc16d157
pnnx readme remove duplicate space ( #4428 )
3 years ago
Fangjun Kuang
607c8f8332
Update README to include sherpa-ncnn for real-time speech recognition ( #4424 )
3 years ago
mizu-bai
c4574586ca
Add Example ncnn-fortran ( #4423 )
3 years ago