Han Gao/Revy/Rabenda
3fcb8a7263
Add 2K2000 benchmark ( #5214 )
2 years ago
Zhenyu ZHAO
ef29bbec49
Python pnnx with pnnx binary ( #5067 )
2 years ago
nihui
5a8ce63af4
optimize resize bilinear and compress font data ( #5200 )
2 years ago
FantasyGmm
3cff53c30a
add Jetson TX2 benchmark ( #5213 )
2 years ago
nihui
eea3fc9b41
optimize vulkan global pooling ( #5191 )
Co-authored-by: nihui <nihui@users.noreply.github.com>
Co-authored-by: michaelcai <michaelcai@tencent.com>
2 years ago
nihui
1138312f1e
detect avx512 isa with signal action on macos ( #5185 )
2 years ago
nihui
dba87f8cad
fix build with msvc arm64 asimdhp ( #5176 )
2 years ago
nihui
8b38cd8e0b
fix linux sde ci ( #5184 )
2 years ago
JeremyRand
c1d952da7e
Fix "Rasberry" typo ( #5182 )
Co-authored-by: Jeremy Rand <jeremyrand@danwin1210.de>
2 years ago
mahiru
5c432d5985
add the benchmark of 8gen2 ( #5180 )
2 years ago
JeremyRand
765ac7aef6
Update Vulkan dependency docs ( #5178 )
Vulkan deps are optional. vulkan-utils is replaced with vulkan-tools
since Debian 10.
Co-authored-by: Jeremy Rand <jeremyrand@danwin1210.de>
2 years ago
nihui
3785921e01
pnnx convert nn.Upsample with recompute_scale_factor=True ( #5169 )
2 years ago
dependabot[bot]
8fd5a1fa4c
Bump actions/checkout from 3 to 4 ( #5171 )
Bumps [actions/checkout](https://github.com/actions/checkout ) from 3 to 4.
- [Release notes](https://github.com/actions/checkout/releases )
- [Changelog](https://github.com/actions/checkout/blob/main/CHANGELOG.md )
- [Commits](https://github.com/actions/checkout/compare/v3...v4 )
---
updated-dependencies:
- dependency-name: actions/checkout
dependency-type: direct:production
update-type: version-update:semver-major
...
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2 years ago
JeremyRand
d1f6193250
Update POWER Clang version docs ( #5174 )
Clang prior to 13 no longer fails to build ncnn since #4845 .
Clang 18 fixes SSE4.1 translation, which yields a major speedup.
Co-authored-by: Jeremy Rand <jeremyrand@danwin1210.de>
2 years ago
nihui
deae9e61da
disable rtti and exceptions for msvc ( #5167 )
* disable rtti and exceptions for msvc
* warnings--
* erff
* arch sse2 for 32bit build
* enable rtti for cross compiling
2 years ago
nihui
5cc1307f72
add woa linux ci ( #5162 )
2 years ago
nihui
058aa0ad37
enable arm neon intrinsics for msvc build ( #5151 )
2 years ago
FantasyGmm
a029587725
Update RPi 5 GPU Benchmark Result ( #5160 )
use mesa-23.3-rc4 driver
2 years ago
FantasyGmm
22c990ac65
Add Raspberry Pi 5 GPU Benchmark ( #5152 )
2 years ago
AlOa
9f26eeb5a7
Prelu layer uses sse instruction _mm_load_ps but data can be misaligned so it must use _mm_loadu_ps ( #5149 )
2 years ago
Justin Fung
465debe9bb
Add print statements for 4 dimensions benchmark ( #5148 )
2 years ago
nihui
4136de3b8d
arm optimization for convolution int8 packed unified elempack ( #5147 )
2 years ago
Zhuo Zhang
54e58bfaf5
Improve cmake toolchain files ( #5144 )
* [style] Use same style for cmake `set` command
* [build] Specify CMAKE_FIND_ROOT_PATH_MODE_PACKAGE in xxx.toolchain.cmake
* [build] Allow user passing args for cmake find cmds
2 years ago
nihui
a8c47bce33
fix python ncnn vulkan build ( #5143 )
2 years ago
張小凡
2ecaf37a3e
Fix find GPU driver dll path in windows ( #5141 )
2 years ago
nihui
b4f26237cb
in-house vulkan loader ( #5130 )
* vulkan-driver-loader.md
* static vulkan on apple
2 years ago
ningjiang233
b2f12fdd67
delete useless setences ( #5139 )
2 years ago
nihui
31e315981a
watchos and tvos ci ( #5124 )
2 years ago
nihui
39bc71c941
support big endian platform, add powerpc ci ( #5121 )
2 years ago
nihui
4494aadd74
deconvolution dynamic weight ( #5119 )
2 years ago
mizu-bai
c96bc08307
Add OneCloud Amlogic S805 ( #5120 )
2 years ago
Justin Fung
009f5eae97
Add description of build for Nintendo 3DS homebrew launcher ( #5116 )
2 years ago
nihui
6c6c40edb3
fix deconvolution x86 unaligned bias load ( #5112 )
2 years ago
teng
16524ca937
pnnx fix indent errors ( #5113 )
2 years ago
nihui
37a108c4c4
pnnx run test_inference by default ( #5110 )
2 years ago
nihuini
c2ccbf851d
pnnx fix filename base
2 years ago
nihui
3116e028d2
pnnx drop sdap scale=None for compatiblity with old torch ( #5107 )
2 years ago
nihui
14e14a9ae8
slice with indices ( #5103 )
2 years ago
nihui
3eb2969db9
fix build with ohos toolchain ( #5105 )
2 years ago
nihui
9dda7e385a
fix gridsample x86 warnings ( #5096 )
2 years ago
nihui
7afdbfa680
simplify vulkan conv1d ( #5095 )
2 years ago
nihui
efcfe95a42
pnnx do not sanitize directory part ( #5099 )
2 years ago
nihui
0c2a4a29de
pnnx convert batchnorm function ( #5097 )
2 years ago
nihui
54ab8051e3
fix warnings ( #5094 )
2 years ago
邓实诚
a1e3ebf8e5
implement simplemath ( #4905 )
* complete abs, fmod and sin function in simplemath.h
* remove some unused variables in simplemath.cpp
* modify test-coverage.yml and add some functions to simplemath.cpp
* modify erf.cpp which included math.h
* include platform.h for NCNN_SIMPLEMATH definition
* move utility constants and functions in simplemath.h to simplemath.cpp
* guard simplemath functions with extern "C"
* add NCNN_EXPORT macro in simplemath.h
* include plateform.h and guard all declarations with NCNN_SIMPLEMATH
* clean unused code in test_unaryop.cpp
* guard #include <vector> with NCNN_SIMPLEMATH in benchncnn.cpp
* add 'static' to guard functions that not declarated in header file
* modify sin and cos with better implementation
---------
Co-authored-by: HonestDeng <HonestDeng@users.noreply.github.com>
2 years ago
Zhenyu ZHAO
a6decad15d
Python ncnn with vulkan ( #5020 )
* add option for enabling vulkan in setup.py
* load winograd type options for pybind.
* enable the usage of humbletim/install-vulkan-sdk in ci
* vulkan-loader with release build
* python setup.py install --vulkan=on
* moltenVK for x86_64 macos
* remove universal2 temporarily
2 years ago
nihui
80b3b9c6f0
arm optimization for convolution int8 winograd unified elempack ( #5087 )
* enable out elempack 8 for winograd and sgemm
2 years ago
Yoh
3f437d3f3d
Grid sample op ( #4373 )
* pnnx support grid_sample op
* complete the permute and gridsample operator fusion
* spilt calculation into two stages and support permute fusion
2 years ago
FhqTreap
dc25128195
Vulkan conv1d ( #5060 )
2 years ago
張小凡
84aaedbe96
Added 5 devices(AMD*2, Intel*2, NVIDIA*1) benchmark result. ( #5085 )
2 years ago