dependabot[bot]
4e6b5d9779
Bump github/codeql-action from 1 to 2 ( #3737 )
Bumps [github/codeql-action](https://github.com/github/codeql-action ) from 1 to 2.
- [Release notes](https://github.com/github/codeql-action/releases )
- [Changelog](https://github.com/github/codeql-action/blob/main/CHANGELOG.md )
- [Commits](https://github.com/github/codeql-action/compare/v1...v2 )
---
updated-dependencies:
- dependency-name: github/codeql-action
dependency-type: direct:production
update-type: version-update:semver-major
...
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
4 years ago
BUG1989
30dec7ae28
Update the benchmark of jetson agx xavier ( #3735 )
4 years ago
nihui
57f3017afd
pnnx torch bmm max min ( #3736 )
* convert torch.bmm
* convert torch.min torch.max
4 years ago
BUG1989
f14388a150
Add the benchmark of Jetson AGX Orin ( #3733 )
4 years ago
FeiGeChuanShu
3f6fbcc7a2
add auto num_class for yolov5_pnnx example ( #3729 )
4 years ago
Yoh
bf1109c10b
fix elempack16 bug ( #3728 )
4 years ago
Xavier Hsinyuan
cea2aff3bb
RVV Optimized erfc and GELU ( #3726 )
* RVV optimized GELU(fp32) and erfc(fp32)
4 years ago
jasonZhang
98e35ded36
arm neon optimization for arcsin/arccos pack4 ( #3725 )
4 years ago
nihui
6125c9f47c
add tips for disabling android ui and setting cpu/gpu performance mode
4 years ago
nihui
b4a31638f9
update rk3399 rk3288 gpu benchmark data
4 years ago
jasonZhang
8e2e4db3f4
arm neon optimization for tan pack4 ( #3722 )
Co-authored-by: jasonZhang892 <jasonZhang892@users.noreply.github.com>
4 years ago
nihui
308965b7e9
sanitize cooperative matrix option in tests
4 years ago
nihui
50fa6d39c0
enable fp16a for mali t760 v2
4 years ago
nihui
7600270430
create uop in spirv-1 mode for vulkan 1.0 compatibility ( #3721 )
4 years ago
nihui
a061871c1c
use getauxval since android api 18 ( #3718 )
4 years ago
UNeedCryDear
f3129be69a
Reduce yolov5 post-processing time. ( #3649 )
* Reduce yolov5 post time.
sigmoid(X)<1,
IF:box_confidence<prob_threshold
THEN: confidence=box_confidence*sigmoid(class_score)<box_confidence<prob_threshold,
So there is no need to calculate class_score when box_confidence<prob_threshold,and the confidence of most anchors is lower than the prob_threshold.
Co-authored-by: UNeedCryDear <UNeedCryDear@users.noreply.github.com>
4 years ago
nihui
ae75a093fa
update print 4d mat, remove deprecated content
4 years ago
nihui
88da92f9bf
fix pnnx macos install rpath
4 years ago
nihui
a14acbc6dd
pnnx save ncnn bin with fp16 storage ( #3715 )
4 years ago
nihui
0623f94eca
arm neon optimization for softmax ( #3713 )
4 years ago
nihui
0ea327b557
x86 sse/avx/avx512 optimization for softmax ( #3712 )
4 years ago
nihui
9826f3dbf8
shader include vulkan activation, workaround for moltenvk tanh half4 issue ( #3711 )
4 years ago
jasonZhang
2b0064c3db
fix armv7 platform build tests error ( #3710 )
4 years ago
nihui
49e70c81a6
update linking glslang libraries
4 years ago
nihui
376f14bba4
windows ci matrix ( #3709 )
4 years ago
nihui
eadb2381dd
force fp32 accumulator for global pooling ( #3708 )
4 years ago
nihui
715a248aab
reduce armv7 inline assembly gpr constraint count, fix armhf build ( #3707 )
4 years ago
nihui
a0b9e8c143
fix build x86 avx512 source with old gcc ( #3705 )
4 years ago
nihui
06fadd374f
fix pnnx hardtanh ncnn conversion
4 years ago
tpoisonooo
5b1bd0f78a
style(src/layer/x86): fix build warning ( #3699 )
4 years ago
nihui
a03c135319
enable policy cmp0091 before project pnnx
4 years ago
nihui
f945c5cfbc
update ls2k1000 benchmark data
4 years ago
nihui
b051b6d8cd
apply code-format changes
4 years ago
nihui
c0822fa3b4
sanitize batchnorm divide by zero error, mips msa optimization for batchnorm
4 years ago
nihui
090ac68de5
use msa no signal compare
4 years ago
nihui
337bed7029
copy_make_border border reflect type
4 years ago
nihui
0f2f25f0af
pnnx static msvc crt option
4 years ago
nihui
2a00a74c1d
add loongarch ci build status
4 years ago
nihui
5b7268d95f
loongarch64 ci ( #3455 )
4 years ago
nihui
131f3d1323
x86 avx512 optimization for convolution winograd pack16to1 and deconvolution family, increase simpleomp argv count ( #3694 )
* convolution winograd pack16to1
* x86 deconvolution and deconvolutiondepthwise
* simpleomp allow 32 arguments
* drop shadow variable workaround
* less winograd test error
4 years ago
_0Mirror
2dcd85ca71
docs: fix docs about 'Build for iOS on macOS with xcode' ( #3696 )
4 years ago
nihui
3d169b3237
x86 avx512 optimization ( #3691 )
* convolution sgemm pack16to1
* convolution sgemm pack4to16
* eltwise avx512
4 years ago
nihui
9298d05e86
split convolution winograd transform input output ( #3688 )
4 years ago
nihui
32560f47de
detect more baseline avx512 flags ( #3687 )
4 years ago
dependabot[bot]
a0621487ac
Bump actions/download-artifact from 2 to 3 ( #3686 )
Bumps [actions/download-artifact](https://github.com/actions/download-artifact ) from 2 to 3.
- [Release notes](https://github.com/actions/download-artifact/releases )
- [Commits](https://github.com/actions/download-artifact/compare/v2...v3 )
---
updated-dependencies:
- dependency-name: actions/download-artifact
dependency-type: direct:production
update-type: version-update:semver-major
...
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
4 years ago
dependabot[bot]
c7808d2c6a
Bump actions/upload-artifact from 2 to 3 ( #3685 )
Bumps [actions/upload-artifact](https://github.com/actions/upload-artifact ) from 2 to 3.
- [Release notes](https://github.com/actions/upload-artifact/releases )
- [Commits](https://github.com/actions/upload-artifact/compare/v2...v3 )
---
updated-dependencies:
- dependency-name: actions/upload-artifact
dependency-type: direct:production
update-type: version-update:semver-major
...
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
4 years ago
nihui
dadc640c66
x86 avx512 optimization ( #3581 )
* unified relu avx512
* unifed clip avx512
* unaryop avx512
* sigmoid avx512
* binaryop avx512
* padding convolution avx512
* convolutiondepthwise avx512
* innerproduct avx512
* reshape avx512
* slice avx512
* hardsigmoid hardswish avx512
* swish avx512
* pooling avx512
* crop avx512
* convolution sgemm pack16
* convolution 3x3 winograd pack16
* interp avx512
* convolution sgemm pack1to16
* convolution sgemm pack16to8
* convolution sgemm pack8to16
* convolution sgemm pack16to4
* fix vulkan permute pack8
* fix vulkan convolution gemm pack8to1
4 years ago
nihui
462b80052f
define NOMINMAX for pnnx windows build
4 years ago
nihui
a9c59bb93c
add -mavx512bw flag for avx512 build ( #3671 )
4 years ago
dependabot[bot]
97f0fbea01
Bump codecov/codecov-action from 2.1.0 to 3 ( #3680 )
Bumps [codecov/codecov-action](https://github.com/codecov/codecov-action ) from 2.1.0 to 3.
- [Release notes](https://github.com/codecov/codecov-action/releases )
- [Changelog](https://github.com/codecov/codecov-action/blob/master/CHANGELOG.md )
- [Commits](https://github.com/codecov/codecov-action/compare/v2.1.0...v3 )
---
updated-dependencies:
- dependency-name: codecov/codecov-action
dependency-type: direct:production
update-type: version-update:semver-major
...
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
4 years ago