nihui
3e2b3fa04d
more stricter armv7 fp16 and armv84 bf16 compiler check, fix #4147 fix #4222 ( #4247 )
3 years ago
nihui
cef95f64a4
pnnx math operation ( #4251 )
3 years ago
MisakaBit
bbbe17c5b5
docs: disable fp16 when wrong results encountered caused by overflow ( #4248 )
3 years ago
LinHe
9426e21166
Memory Pool Improvement For Variadic Sized Inputs ( #4190 )
* Simple miss count for better space efficiency
* Simple double ended greedy;
* Add size drop threshold setter;
* set workspace allocator cr to zero as we had some sort of recylcing capability :P
Co-authored-by: LinHeLurking <LinHeLurking@users.noreply.github.com>
Co-authored-by: nihuini <nihuini@tencent.com>
3 years ago
Zhang Ge
3fce00b260
fix param parsing issue when layer/blob name exceeds 255 ( #4236 )
* fix param parsing issue when layer/blob name exceeds 255
* apply code-format changes
Co-authored-by: ZhangGe6 <ZhangGe6@users.noreply.github.com>
3 years ago
柚木鉉
eb9bb5d191
Add benchmark result of T-Head TH1520 ( #4240 )
`cpuinfo`:
```
isa : rv64imafdcvsu
mmu : sv39
cpu-freq : 1.848Ghz
cpu-icache : 64KB
cpu-dcache : 64KB
cpu-l2cache : 1MB
cpu-tlb : 1024 4-ways
cpu-cacheline : 64Bytes
cpu-vector : 0.7.1
```
Compiled with `-DCMAKE_TOOLCHAIN_FILE=../toolchains/c910-v240.toolchain.cmake -DCMAKE_BUILD_TYPE=release -DNCNN_OPENMP=OFF -DNCNN_THREADS=OFF -DNCNN_RUNTIME_CPU=OFF -DNCNN_RVV=ON -DNCNN_SIMPLEOCV=ON -DNCNN_BUILD_EXAMPLES=ON`
Seems much worse than expected 🤔
3 years ago
nihui
59a6fa37e2
add c906 build ci ( #4232 )
3 years ago
dependabot[bot]
bdcd68f574
Bump pypa/cibuildwheel from 2.10.1 to 2.10.2 ( #4220 )
Bumps [pypa/cibuildwheel](https://github.com/pypa/cibuildwheel ) from 2.10.1 to 2.10.2.
- [Release notes](https://github.com/pypa/cibuildwheel/releases )
- [Changelog](https://github.com/pypa/cibuildwheel/blob/main/docs/changelog.md )
- [Commits](https://github.com/pypa/cibuildwheel/compare/v2.10.1...v2.10.2 )
---
updated-dependencies:
- dependency-name: pypa/cibuildwheel
dependency-type: direct:production
update-type: version-update:semver-patch
...
Signed-off-by: dependabot[bot] <support@github.com>
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
3 years ago
Xavier Hsinyuan
e7eadca6c1
RVV: use new interface for segment load/store & change word_type to size_t&add clang ci (part #4100 ) ( #4118 )
* RVV: use size_t for vl
* RVV: replace vsseg.v tuple type by using regex
-----
search:
vsseg([1-9])e(8|16|32)_v_(f|i|u)\2m(1|2|4|8)x\1\(([ -~]+), vcreate_\3\2m\4x\1\(([ -~]+)\), vl\);
substitute by:
vsseg$1e$2_v_$3$2m$4($5, $6, vl);
* RVV: replace vssseg.v tuple types by using regex
---
search:
vssseg([1-9])e(8|16|32)_v_f\2m1x\1\(([ -~]+), vcreate_f\2m1x\1\(([ -~]+)\), vl\);
substitute by:
vssseg$1e$2_v_f$2m1($3, $4, vl);
* RVV: replace vlseg.v tuple types in load/store
* RVV: replace vloxseg2ei32.v tuple types
* RVV: add a wrapper for old compilers
* RVV: add segment load/store wrapper in pakcing
* RVV: fix cmake test
* RVV: make clang happy by dropping VLAs in sgemm
* RVV: add clang cmake toolchain configure
* RVV: add clang ci, riscv64-unknown-linux-gnu
Co-authored-by: thelastlin <thelastlin@users.noreply.github.com>
Co-authored-by: nihui <shuizhuyuanluo@126.com>
3 years ago
Zhuo Zhang
4f9e3984a5
Ignore CMakeSettings.json, the Visual Studio CMake schema file ( #4228 )
3 years ago
汤圆奶昔
d30fc825d4
style: space alignment ( #4217 )
3 years ago
dependabot[bot]
183e6e99b9
Bump pypa/cibuildwheel from 2.9.0 to 2.10.1 ( #4207 )
Bumps [pypa/cibuildwheel](https://github.com/pypa/cibuildwheel ) from 2.9.0 to 2.10.1.
- [Release notes](https://github.com/pypa/cibuildwheel/releases )
- [Changelog](https://github.com/pypa/cibuildwheel/blob/main/docs/changelog.md )
- [Commits](https://github.com/pypa/cibuildwheel/compare/v2.9.0...v2.10.1 )
---
updated-dependencies:
- dependency-name: pypa/cibuildwheel
dependency-type: direct:production
update-type: version-update:semver-minor
...
Signed-off-by: dependabot[bot] <support@github.com>
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
3 years ago
Zhuo Zhang
bfe27f2aed
Ignore .xmake directory ( #4212 )
3 years ago
Lry89757
5eb56b2ea5
[Gelu x86] Finish intrinsic with elempack merged(fast version) ( #4144 )
* Finish the gelu x86 intrinsics
* Finish the fast tanh x86 simd impl
3 years ago
magicse
6167b729c5
changed size of images for pretty formatting of page ( #4193 )
3 years ago
Lry89757
9f59711338
[Prelu x86] Finish intrinsic with elempack merged ( #4177 )
3 years ago
Lry89757
b16f8ca921
[docs] Fix typo ( #4201 )
3 years ago
luqiang guo
5148224516
optmize softmax arm neon ( #4171 )
3 years ago
Menci
479a73a62a
remove duplicated newline ( #4188 )
3 years ago
Molly Sophia
1d7b2172cc
remove duplicated newline ( #4187 )
3 years ago
Lry89757
9278f90114
[Elu x86] Finish intrinsic with elempack merged ( #4153 )
3 years ago
Kenji Mouri
964be3b536
Add benchmark result of Intel Celeron M 420. ( #4141 )
3 years ago
nanjoin
3c0096c548
fix ConvolutionDepthwise allocator not updated ( #4173 )
3 years ago
dependabot[bot]
9af349cad8
Bump pypa/cibuildwheel from 2.8.1 to 2.9.0 ( #4134 )
Bumps [pypa/cibuildwheel](https://github.com/pypa/cibuildwheel ) from 2.8.1 to 2.9.0.
- [Release notes](https://github.com/pypa/cibuildwheel/releases )
- [Changelog](https://github.com/pypa/cibuildwheel/blob/main/docs/changelog.md )
- [Commits](https://github.com/pypa/cibuildwheel/compare/v2.8.1...v2.9.0 )
---
updated-dependencies:
- dependency-name: pypa/cibuildwheel
dependency-type: direct:production
update-type: version-update:semver-minor
...
Signed-off-by: dependabot[bot] <support@github.com>
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
3 years ago
tpoisonooo
acbaaa665b
fix compile warnings for unused parameter ( #4131 )
3 years ago
septs
0b3d157a62
refactor: README ( #4114 )
3 years ago
Lry89757
00c08d7bda
[Batchnorm x86] Merge the multiple elempack ( #4085 )
Co-authored-by: LRY89757 <LRY89757@users.noreply.github.com>
Co-authored-by: nihuini <nihuini@tencent.com>
3 years ago
LinHe
03f2ad38ce
Layer Norm x86 SIMD Optimizations ( #4065 )
Co-authored-by: LinHeLurking <LinHeLurking@users.noreply.github.com>
Co-authored-by: LRY89757 <LRY89757@users.noreply.github.com>
Co-authored-by: nihuini <nihuini@tencent.com>
3 years ago
nihui
b4ba207c18
more strict compiler rvv checks, drop rvv-071 support ( #4094 )
3 years ago
nihui
0666143513
fix vulkan winograd weight layout with cooperative matrix enabled ( #4093 )
3 years ago
miemie2013
720f3c9aab
Add DeformableConv2D ( #4070 )
* Add DeformableConv2D
* add unittest and docs
* pnnx torchvision deformconv2d conversion
Co-authored-by: miemie2013 <miemie2013@users.noreply.github.com>
Co-authored-by: nihui <shuizhuyuanluo@126.com>
3 years ago
nihui
d5f0d89313
migrate ci android ndk-bundle to ANDROID_NDK_LATEST_HOME env ( #4087 )
* migrate ci android ndk-bundle to ANDROID_NDK_LATEST_HOME env
* fix android noneon ci
3 years ago
dependabot[bot]
c356b4d93e
Bump pypa/gh-action-pypi-publish from 1.5.0 to 1.5.1 ( #4086 )
Bumps [pypa/gh-action-pypi-publish](https://github.com/pypa/gh-action-pypi-publish ) from 1.5.0 to 1.5.1.
- [Release notes](https://github.com/pypa/gh-action-pypi-publish/releases )
- [Commits](https://github.com/pypa/gh-action-pypi-publish/compare/v1.5.0...v1.5.1 )
---
updated-dependencies:
- dependency-name: pypa/gh-action-pypi-publish
dependency-type: direct:production
update-type: version-update:semver-patch
...
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
3 years ago
Zhouzhou
4158e63668
docs:add sse optimized zh ( #4053 )
Signed-off-by: Zhouzhou <1197236910@qq.com>
3 years ago
nihui
402001128d
eliminate math expression with torch.zeros/ones input ( #4083 )
3 years ago
nihui
4f414c1806
implement 4d memorydata ( #4074 )
* implement 4d memorydata
* fix ncnnoptimize memorydata 4d
3 years ago
Lry89757
13a9533984
[BatchNorm Optimize x86] AVX512 intrinsic ( #4061 )
* Add the test samples for elempack==16
* Add the AVX512 Support for batchnorm
3 years ago
nihuini
e33c85ca82
fix pnnx crash on dynamic pads, do not eliminate upsample for dynamic shape
3 years ago
nihui
30ab31cc41
add address sanitizer ci, fix potential memory leak shouted by asan ( #4058 )
3 years ago
nihui
0ea7a672fa
fix undefined reference to vkGetAndroidHardwareBufferPropertiesANDROID, add android-29 shared ci ( #4056 )
3 years ago
nihui
4bc4a5ed0b
check mat create oom ( #4054 )
3 years ago
nihui
1d0917c83b
fix build with very old gcc ( #4048 )
* clear bom marker, avoid vector data function
3 years ago
nihui
e64245c44a
ci x86 no sse, do not force sse2 for x86 32bit toolchain ( #4043 )
3 years ago
dependabot[bot]
4b118bffa4
Bump pypa/cibuildwheel from 2.8.0 to 2.8.1 ( #4045 )
Bumps [pypa/cibuildwheel](https://github.com/pypa/cibuildwheel ) from 2.8.0 to 2.8.1.
- [Release notes](https://github.com/pypa/cibuildwheel/releases )
- [Changelog](https://github.com/pypa/cibuildwheel/blob/main/docs/changelog.md )
- [Commits](https://github.com/pypa/cibuildwheel/compare/v2.8.0...v2.8.1 )
---
updated-dependencies:
- dependency-name: pypa/cibuildwheel
dependency-type: direct:production
update-type: version-update:semver-patch
...
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
3 years ago
苏业钦
b253f8c9b3
loongarch target -> mips: ( #4044 )
3 years ago
nihui
b0c40fa644
unified arm eltwise elempack ( #4040 )
3 years ago
ArchieMeng
40a8d31d9b
Add svg file for ncnn logo ( #4037 )
3 years ago
Xiang Shin Wuu
2b298e8fef
example for ncnn-android-yolov7 added ( #4038 )
3 years ago
nihui
76849cede4
armv8.4 i8mm optimization for convolution gemm int8 ( #4034 )
3 years ago
Jie Li
627be612c6
fix(examples): optimize nms_sorted_bboxes ( #4030 )
3 years ago