Zhuo Zhang
bfe27f2aed
Ignore .xmake directory ( #4212 )
3 years ago
Lry89757
5eb56b2ea5
[Gelu x86] Finish intrinsic with elempack merged(fast version) ( #4144 )
* Finish the gelu x86 intrinsics
* Finish the fast tanh x86 simd impl
3 years ago
magicse
6167b729c5
changed size of images for pretty formatting of page ( #4193 )
3 years ago
Lry89757
9f59711338
[Prelu x86] Finish intrinsic with elempack merged ( #4177 )
3 years ago
Lry89757
b16f8ca921
[docs] Fix typo ( #4201 )
3 years ago
luqiang guo
5148224516
optmize softmax arm neon ( #4171 )
3 years ago
Menci
479a73a62a
remove duplicated newline ( #4188 )
3 years ago
Molly Sophia
1d7b2172cc
remove duplicated newline ( #4187 )
3 years ago
Lry89757
9278f90114
[Elu x86] Finish intrinsic with elempack merged ( #4153 )
3 years ago
Kenji Mouri
964be3b536
Add benchmark result of Intel Celeron M 420. ( #4141 )
3 years ago
nanjoin
3c0096c548
fix ConvolutionDepthwise allocator not updated ( #4173 )
3 years ago
dependabot[bot]
9af349cad8
Bump pypa/cibuildwheel from 2.8.1 to 2.9.0 ( #4134 )
Bumps [pypa/cibuildwheel](https://github.com/pypa/cibuildwheel ) from 2.8.1 to 2.9.0.
- [Release notes](https://github.com/pypa/cibuildwheel/releases )
- [Changelog](https://github.com/pypa/cibuildwheel/blob/main/docs/changelog.md )
- [Commits](https://github.com/pypa/cibuildwheel/compare/v2.8.1...v2.9.0 )
---
updated-dependencies:
- dependency-name: pypa/cibuildwheel
dependency-type: direct:production
update-type: version-update:semver-minor
...
Signed-off-by: dependabot[bot] <support@github.com>
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
3 years ago
tpoisonooo
acbaaa665b
fix compile warnings for unused parameter ( #4131 )
3 years ago
septs
0b3d157a62
refactor: README ( #4114 )
3 years ago
Lry89757
00c08d7bda
[Batchnorm x86] Merge the multiple elempack ( #4085 )
Co-authored-by: LRY89757 <LRY89757@users.noreply.github.com>
Co-authored-by: nihuini <nihuini@tencent.com>
3 years ago
LinHe
03f2ad38ce
Layer Norm x86 SIMD Optimizations ( #4065 )
Co-authored-by: LinHeLurking <LinHeLurking@users.noreply.github.com>
Co-authored-by: LRY89757 <LRY89757@users.noreply.github.com>
Co-authored-by: nihuini <nihuini@tencent.com>
3 years ago
nihui
b4ba207c18
more strict compiler rvv checks, drop rvv-071 support ( #4094 )
3 years ago
nihui
0666143513
fix vulkan winograd weight layout with cooperative matrix enabled ( #4093 )
3 years ago
miemie2013
720f3c9aab
Add DeformableConv2D ( #4070 )
* Add DeformableConv2D
* add unittest and docs
* pnnx torchvision deformconv2d conversion
Co-authored-by: miemie2013 <miemie2013@users.noreply.github.com>
Co-authored-by: nihui <shuizhuyuanluo@126.com>
3 years ago
nihui
d5f0d89313
migrate ci android ndk-bundle to ANDROID_NDK_LATEST_HOME env ( #4087 )
* migrate ci android ndk-bundle to ANDROID_NDK_LATEST_HOME env
* fix android noneon ci
3 years ago
dependabot[bot]
c356b4d93e
Bump pypa/gh-action-pypi-publish from 1.5.0 to 1.5.1 ( #4086 )
Bumps [pypa/gh-action-pypi-publish](https://github.com/pypa/gh-action-pypi-publish ) from 1.5.0 to 1.5.1.
- [Release notes](https://github.com/pypa/gh-action-pypi-publish/releases )
- [Commits](https://github.com/pypa/gh-action-pypi-publish/compare/v1.5.0...v1.5.1 )
---
updated-dependencies:
- dependency-name: pypa/gh-action-pypi-publish
dependency-type: direct:production
update-type: version-update:semver-patch
...
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
3 years ago
Zhouzhou
4158e63668
docs:add sse optimized zh ( #4053 )
Signed-off-by: Zhouzhou <1197236910@qq.com>
3 years ago
nihui
402001128d
eliminate math expression with torch.zeros/ones input ( #4083 )
3 years ago
nihui
4f414c1806
implement 4d memorydata ( #4074 )
* implement 4d memorydata
* fix ncnnoptimize memorydata 4d
3 years ago
Lry89757
13a9533984
[BatchNorm Optimize x86] AVX512 intrinsic ( #4061 )
* Add the test samples for elempack==16
* Add the AVX512 Support for batchnorm
3 years ago
nihuini
e33c85ca82
fix pnnx crash on dynamic pads, do not eliminate upsample for dynamic shape
3 years ago
nihui
30ab31cc41
add address sanitizer ci, fix potential memory leak shouted by asan ( #4058 )
3 years ago
nihui
0ea7a672fa
fix undefined reference to vkGetAndroidHardwareBufferPropertiesANDROID, add android-29 shared ci ( #4056 )
3 years ago
nihui
4bc4a5ed0b
check mat create oom ( #4054 )
3 years ago
nihui
1d0917c83b
fix build with very old gcc ( #4048 )
* clear bom marker, avoid vector data function
3 years ago
nihui
e64245c44a
ci x86 no sse, do not force sse2 for x86 32bit toolchain ( #4043 )
3 years ago
dependabot[bot]
4b118bffa4
Bump pypa/cibuildwheel from 2.8.0 to 2.8.1 ( #4045 )
Bumps [pypa/cibuildwheel](https://github.com/pypa/cibuildwheel ) from 2.8.0 to 2.8.1.
- [Release notes](https://github.com/pypa/cibuildwheel/releases )
- [Changelog](https://github.com/pypa/cibuildwheel/blob/main/docs/changelog.md )
- [Commits](https://github.com/pypa/cibuildwheel/compare/v2.8.0...v2.8.1 )
---
updated-dependencies:
- dependency-name: pypa/cibuildwheel
dependency-type: direct:production
update-type: version-update:semver-patch
...
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
3 years ago
苏业钦
b253f8c9b3
loongarch target -> mips: ( #4044 )
3 years ago
nihui
b0c40fa644
unified arm eltwise elempack ( #4040 )
3 years ago
ArchieMeng
40a8d31d9b
Add svg file for ncnn logo ( #4037 )
3 years ago
Xiang Shin Wuu
2b298e8fef
example for ncnn-android-yolov7 added ( #4038 )
3 years ago
nihui
76849cede4
armv8.4 i8mm optimization for convolution gemm int8 ( #4034 )
3 years ago
Jie Li
627be612c6
fix(examples): optimize nms_sorted_bboxes ( #4030 )
3 years ago
nihui
dd86cebab8
armv8.6 ci and coverage ( #4025 )
* asimdfhm in fc
* move neon bf16 conversion function to arm_usability header
* fix cmake option
* fix build with newer gcc
* arm84 coverage
* arm asimdfhm optimization for innerproduct gemm fp16s
3 years ago
nihui
c5bb0e52ed
add ingenic-x2000 toolchain file
3 years ago
nihui
9b39691cc8
pnnx handle unrecognized file format ( #4028 )
3 years ago
nihui
f1ea792b26
fix too many microtask error in old libomp runtime ( #4002 )
3 years ago
Guo Haria
c1e2ab7205
add yolov7_pnnx example ( #4027 )
3 years ago
xuehao.ma
962a49069a
add the param file of fastestdet in benchmark ( #4026 )
3 years ago
teng
3901b837e2
add example yolov7 ( #4019 )
3 years ago
nihui
9b8272e86d
arm edsp and arm neon optimization for convolution int8 winograd ( #4017 )
3 years ago
nihui
a12cd7c212
mips msa and loongson mmi optimization for convolution int8 winograd f43 ( #4014 )
3 years ago
陸 言
cae8d0f1d7
Add Loongson 2F toolchain support (refer to AOSC) ( #3992 )
3 years ago
nihui
5725c028c0
arm dsp infrastructure and optimization for convolution gemm int8 ( #4011 )
3 years ago
nihui
ef216f732e
armv5 optimization for convolution gemm int8 ( #4010 )
3 years ago