nihui
d68ea6b4d5
update pnnx ci torch 2.8.0 ( #6238 )
9 months ago
nihui
c276398f29
pnnx ci pin safetensors ( #6236 )
9 months ago
nihui
9b91fe5153
implement flip layer and pnnx torch.flip conversion ( #6233 )
Co-authored-by: 佰阅 <43716063+Baiyuetribe@users.noreply.github.com>
9 months ago
nihui
1ad2bc676b
pnnx ncnn handles squeeze/unsqueeze the batch index ( #6232 )
9 months ago
nihui
e207b3bd13
fix vulkan command upload pack16 mat ( #6231 )
9 months ago
nihui
a17e31b2f4
pnnx handles conv convtranspose linear weight norm ( #6224 )
9 months ago
bil0077
b039b87dc0
migrate msvc ci to windows-2022 ( #6210 )
10 months ago
nihui
2f34e7f806
fix pnnx build on msvc, pnnx ci update torch and onnxruntime ( #6208 )
* pin opencv version
10 months ago
nihui
b10c9dd9a5
fix vulkan structure definition conflict with VK_KHR_acceleration_structure ( #6205 )
10 months ago
nihui
8c5e23b625
use size_t type for proper large tensor support ( #6201 )
10 months ago
Copilot
4644540ea4
Add Windows XP support merging PRs #6176 and #6177 ( #6204 )
Co-authored-by: Sugar-Baby <87747602+Sugar-Baby@users.noreply.github.com>
Co-authored-by: AtomAlpaca <66774326+AtomAlpaca@users.noreply.github.com>
10 months ago
nihui
fe509e9bc1
flexible coopmat mnk and unified elempack for vulkan deconvolution gemm ( #6199 )
10 months ago
nihui
091f7da5ba
update spacemit toolchain and qemu test ( #6200 )
10 months ago
nihui
6c44404d34
use pre-downloaded codecov cli binary ( #6198 )
10 months ago
nihui
8e69ddf707
flexible coopmat mnk and unified elempack for vulkan convolution winograd gemm ( #6196 )
10 months ago
nihui
0c3dad7656
flexible coopmat mnk and unified elempack for vulkan convolution gemm ( #6192 )
10 months ago
nihui
d395000edc
flexible coopmat mnk and unified elempack for vulkan convolution 1x1s1d1 ( #6154 )
* helper function for selecting the optimal coopmat mnk size
10 months ago
nihui
a1f5d5be47
fix unaryop on moltenvk ( #6181 )
10 months ago
韦康琦
4f9f7d929e
vulkan unaryop unified elempack shader ( #6179 )
10 months ago
Yexuan Wu
e2d93a482e
Unified elempack activation function vulkan shader ( #6175 )
10 months ago
WANG KE
4d8220ce22
Skip int8 model on GPU and merge from upstream ( #6174 )
10 months ago
GIBEREZ
44982d0d23
About the update to the GLSL documentation after the image functions are deprecated ( #6173 )
10 months ago
Yexuan Wu
11b6990a9e
vulkan sigmoid unified elempack shader ( #6170 )
10 months ago
Willaaaaaaa
04341120d4
feat: add cmake print ver support ( #6165 )
10 months ago
Yexuan Wu
7abf84eb74
Fix win32 GetLogicalProcessorInformationEx API ( #6169 )
* fix crash on nt kernel without GetLogicalProcessorInformationEx
10 months ago
nihui
0cfe201b3c
fix vulkan absval fp16 ( #6167 )
* fix 1d 2d cstep
* fix ranged cstep
10 months ago
Christopher
260f493ada
add cross toolchain cmake config of AK3918(AK) and SS928(hisi) ( #6164 )
10 months ago
AtomAlpaca
f2970319d3
loongarch LASX optimization for math/trigonometric functions and sigmoid ( #6163 )
* Add lasx optimization for loongarch_usability.h
* Add _LOONGARCH_FLOAT_CONST_PS256 to avoid variable redefinition
* Add lasx optimization for math funtion
* Add lasx optimization for sigmoid
* add lsx optimization for trigonometric functions
* add lasx optimization for trigonometric functions
10 months ago
chri321
4c72f52954
docs: update Chinese glsl-extension documentation ( #6162 )
- synchronize the latest English content to the Chinese documentation
- correct spelling errors in the English version of glsl-extention
- Fix spelling 'enable_validation_layer' in src/gpu.cpp
10 months ago
dependabot[bot]
cae9e636d0
Bump stefanzweifel/git-auto-commit-action from 5 to 6 ( #6115 )
Bumps [stefanzweifel/git-auto-commit-action](https://github.com/stefanzweifel/git-auto-commit-action ) from 5 to 6.
- [Release notes](https://github.com/stefanzweifel/git-auto-commit-action/releases )
- [Changelog](https://github.com/stefanzweifel/git-auto-commit-action/blob/master/CHANGELOG.md )
- [Commits](https://github.com/stefanzweifel/git-auto-commit-action/compare/v5...v6 )
---
updated-dependencies:
- dependency-name: stefanzweifel/git-auto-commit-action
dependency-version: '6'
dependency-type: direct:production
update-type: version-update:semver-major
...
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
10 months ago
nihui
171b9d1bba
use spdx license header, copyright Tencent ( #6152 )
10 months ago
nihui
075d07ede2
compute-only vulkan ( #6131 )
11 months ago
zhuzeitou
10912b3b58
Use platform-specific APIs for environment variables ( #6147 )
* Use platform-specific APIs for environment variables
The previous patch used `putenv` as a quick fix for Windows compatibility. However, `putenv` is a legacy API and not the recommended choice.
This commit replaces the single `putenv` call with the most appropriate function for each platform:
- On Windows, it now uses the modern and secure `_putenv_s`.
- On Unix-like systems, it uses the standard `setenv`.
---------
Co-authored-by: nihui <shuizhuyuanluo@126.com>
11 months ago
nihui
9f832c19c1
vulkan int8 packing quantize dequantize requantize ( #3731 )
* add int8 definitions
* packing vulkan int8/int32, quantize vulkan
* vulkan dequantize
* requantize vulkan
11 months ago
J. Zow
ed6dcd0c81
Fix ci error for update linux-riscv64.yml ( #6133 )
11 months ago
nihui
7557f5c208
vulkan absval unified elempack shader ( #6132 )
11 months ago
nihui
626d9d0910
vulkan packing code clean, drop image storage type, unified fp16p fp16s packing ( #6128 )
11 months ago
nihui
bd0b111775
vulkan tight fp16p pack1 ( #6127 )
11 months ago
nihui
24a3b99f1f
drop layer support_image_storage and option use_image_storage ( #6126 )
* fix pyncnn build
11 months ago
nihui
f168962a74
update glslang for fp8, fix fp8 enum ( #6125 )
11 months ago
nihui
abf0de4488
update ruapu to detect zfh zvfh xtheadvector ( #5841 )
* always prefer xtheadvector
* update ci toolchain
11 months ago
nihui
211e238639
drop layer forward vkimagemat ( #6124 )
vkimagemat was originally used as a mat storage in the hope of improving performance on old adreno gpus, but in fact it is slower than the cpu in most cases and is no longer suitable for the latest adreno architecture and large shapes
11 months ago
Yexuan Wu
1cd5373483
instancenorm x86 simd optimization ( #6097 )
11 months ago
nihui
cc40332804
discover VK_KHR_vulkan_memory_model ( #6121 )
11 months ago
nihui
6e3052a88d
print gpu matrix property info ( #6114 )
11 months ago
nihui
8998a13d06
discover VK_EXT_shader_float8 ( #6120 )
11 months ago
nihui
ef5e79d80e
add missing macros for VK_NV_cooperative_vector and VK_NV_cooperative_matrix2 ( #6119 )
11 months ago
nihui
12f57fb3d1
discover VK_NV_cooperative_matrix2 ( #6118 )
11 months ago
nihui
510b461e9a
discover VK_NV_cooperative_vector ( #6117 )
11 months ago
nihui
9cdc02bb7a
unified vulkan khr/nv cooperative matrix shader ( #6116 )
11 months ago