dependabot[bot]
8b68ec050d
Bump actions/checkout from 2 to 3 ( #3588 )
Bumps [actions/checkout](https://github.com/actions/checkout ) from 2 to 3.
- [Release notes](https://github.com/actions/checkout/releases )
- [Changelog](https://github.com/actions/checkout/blob/main/CHANGELOG.md )
- [Commits](https://github.com/actions/checkout/compare/v2...v3 )
---
updated-dependencies:
- dependency-name: actions/checkout
dependency-type: direct:production
update-type: version-update:semver-major
...
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
4 years ago
nihui
2880eff264
deconv1d deconv3d ( #3584 )
* fix sigmoid returns nan with very large input
4 years ago
dependabot[bot]
556aeb675b
Bump actions/setup-python from 2 to 3 ( #3586 )
Bumps [actions/setup-python](https://github.com/actions/setup-python ) from 2 to 3.
- [Release notes](https://github.com/actions/setup-python/releases )
- [Commits](https://github.com/actions/setup-python/compare/v2...v3 )
---
updated-dependencies:
- dependency-name: actions/setup-python
dependency-type: direct:production
update-type: version-update:semver-major
...
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
4 years ago
Yoh
f5b90a5333
Add unaryop x86 ( #3579 )
Co-authored-by: Yoh <wangpeizhou@bilibili.com>
Co-authored-by: Yoh-Z <Yoh-Z@users.noreply.github.com>
4 years ago
nihuini
bc188ece58
update modelwriter for new operators
4 years ago
nihui
457e066eb5
x86 f16c infrastructure ( #3577 )
4 years ago
nihui
38ae671391
add yolov5 pnnx example
4 years ago
Yoh
4b68e3f9c1
Opt avxmath ( #3563 )
* optimize x86 avx exp log sincos fma
* optimize avx&sse_mathfun exp,log,cos,sin,sincos and add fnmadd to x86_usability
Co-authored-by: Yoh-Z <Yoh-Z@users.noreply.github.com>
4 years ago
nihui
920aa79f04
drop x86 avx2 fp16 ( #3568 )
4 years ago
nihui
6b2495cc24
add reshape before and after pooling 123d with no batch dimension ( #3566 )
4 years ago
nihui
76e32e9ee6
fix interp nearest by scale factor, fix issue #3555 ( #3565 )
* lets accept the float div vs recp error
4 years ago
nihuini
6ee5ab72f6
handle reshape 5d with batch index 0
4 years ago
tpoisonooo
cfacba273f
improvement(binaryop): use MAKE_FUNCTION macro ( #3559 )
Co-authored-by: MegEngine <megengine@megvii.com>
Co-authored-by: tpoisonooo <tpoisonooo@users.noreply.github.com>
4 years ago
nihui
68d0c8bccb
apply code-format changes
4 years ago
nihui
fb5d5c011a
convert F.pad with negative pads, eliminate noop expression
4 years ago
nihui
d452eca28f
convert torch.matmul, eliminate noop pad and identity op, fuse transpose matmul, fuse select to unbind ( #3554 )
4 years ago
nihui
b352221ba3
handle pnnx half weight when converting to ncnn
4 years ago
nihui
4654030541
decouple x86 fma avx2 ( #3560 )
4 years ago
nihui
e1a8c87bc8
wrap namespace for function objects ( #3558 )
4 years ago
nihui
3a5bb6c315
update ci swiftshader 20220211 ( #3557 )
4 years ago
nihuini
69737ad169
fix permute vulkan
4 years ago
nihuini
d6df25c73f
fix avx512 ci
4 years ago
nihuini
57a7101fc6
fix ci, second try
4 years ago
nihuini
cfedcfdc57
fix ci, first try
4 years ago
Yuzhong Yan
681141ff42
[YZ] Fix bug in unit test ( #3556 )
4 years ago
Evgeny Proydakov
6f824c57a1
Removed travis CI from sourcetree & readme. ( #3488 )
4 years ago
nihui
aa89529839
disable lto in ci build for oom issue ( #3552 )
4 years ago
nihui
d276ecd401
fix avx512 illegal instruction ( #3550 )
4 years ago
nihuini
51ecc33d9d
check avx512vl extension for discarding old-slow avx512 chips, enable avx512 option by default
4 years ago
nihui
3f2799d706
always build tightly packed weight, fix #3545 ( #3547 )
4 years ago
nihui
a46edcf720
x86 optimization for interp ( #3546 )
4 years ago
nihui
139554b36e
rewrite convolution x86 sgemm pack1 ( #3544 )
4 years ago
Yoh
d2999b8d53
Optimize scale x86 ( #3540 )
Co-authored-by: Yoh-Z <Yoh-Z@users.noreply.github.com>
4 years ago
nihui
fb6283c8b0
x86 avx fma optimization ( #3543 )
4 years ago
nihui
3a43cc7015
update efficientnetv2_b0 param for reduction axes changes
4 years ago
nihui
3181616439
treat old reduction axes param as failure
4 years ago
nihui
672daa7e04
xop infrastructure and optimization ( #3541 )
4 years ago
nihui
9d0c36358c
add z8350 and n5105 benchmark
4 years ago
nihui
de77b669c4
x86 sse2 optimization for conv1x1/3x3 pack4 and general sgemm pack4/pack4to1 ( #3538 )
* x86 sse2 optimization for conv1x1 conv3x3 pack4 and general sgemm pack4/pack4to1
* x86 sse2 optimization for conv3x3s1 pack4to1 and general sgemm convolution pack4to1, use aligned load/store
* enforce explicit alignment
4 years ago
nihui
6422e6acd3
fix x86 sgemm convolution int8 weight shuffle
4 years ago
nihui
340b4e673e
pnnx fold constant ( #3521 )
4 years ago
Kagurazaka Kotori
08ecc94d63
x86: Use _mm_cvtsi128_si{32,64} in float2int8 ( #3536 )
This patch uses _mm_cvtsi128_si{32,64} intrinsics when returning value
in float2int8() to reduce unnecessary memory accesses.
Resolves TODO "use _mm_cvtsi128_si64 on 64bit target".
Signed-off-by: Kagurazaka Kotori <kagurazakakotori@gmail.com>
4 years ago
nihui
1d0b78f9b6
Update README.md
4 years ago
nihui
a356d152bb
Update README.md
4 years ago
Joson
70795c6548
Create README.md ( #3532 )
4 years ago
teng
3ff9ae707f
simplify macro ( #3530 )
4 years ago
Kagurazaka Kotori
5c078016c2
x86/avx_mathfun.h: Remove fallback warnings ( #3527 )
* x86/avx_mathfun.h: Remove fallback warnings
This patch removes warning messages indicating falling back to SSE2
when AVX2 support is disabled as suggested. Also reorders non-AVX2
macros for readability and faster preprocessing.
Suggested-by: nihui <shuizhuyuanluo@126.com>
Signed-off-by: Kagurazaka Kotori <kagurazakakotori@gmail.com>
* apply code-format changes
Co-authored-by: kagurazakakotori <kagurazakakotori@users.noreply.github.com>
4 years ago
nihui
2d46994d2e
wrap avxvnni and avx512vnni build options over cpu feature detector
4 years ago
nihui
33e225f173
fix c api test
4 years ago
nihui
bae2ee375f
simplify c api layer forward_n output array type
4 years ago