Jeff Zhou
41a9df5cd2
Merge fbfa1190e1 into e207b3bd13
9 months ago
nihui
d395000edc
flexible coopmat mnk and unified elempack for vulkan convolution 1x1s1d1 ( #6154 )
* helper function for selecting the optimal coopmat mnk size
10 months ago
chri321
4c72f52954
docs: update Chinese glsl-extension documentation ( #6162 )
- synchronize the latest English content to the Chinese documentation
- correct spelling errors in the English version of glsl-extention
- Fix spelling 'enable_validation_layer' in src/gpu.cpp
10 months ago
nihui
171b9d1bba
use spdx license header, copyright Tencent ( #6152 )
10 months ago
nihui
075d07ede2
compute-only vulkan ( #6131 )
10 months ago
nihui
9f832c19c1
vulkan int8 packing quantize dequantize requantize ( #3731 )
* add int8 definitions
* packing vulkan int8/int32, quantize vulkan
* vulkan dequantize
* requantize vulkan
11 months ago
nihui
626d9d0910
vulkan packing code clean, drop image storage type, unified fp16p fp16s packing ( #6128 )
11 months ago
nihui
bd0b111775
vulkan tight fp16p pack1 ( #6127 )
11 months ago
nihui
24a3b99f1f
drop layer support_image_storage and option use_image_storage ( #6126 )
* fix pyncnn build
11 months ago
nihui
211e238639
drop layer forward vkimagemat ( #6124 )
vkimagemat was originally used as a mat storage in the hope of improving performance on old adreno gpus, but in fact it is slower than the cpu in most cases and is no longer suitable for the latest adreno architecture and large shapes
11 months ago
nihui
cc40332804
discover VK_KHR_vulkan_memory_model ( #6121 )
11 months ago
nihui
6e3052a88d
print gpu matrix property info ( #6114 )
11 months ago
nihui
8998a13d06
discover VK_EXT_shader_float8 ( #6120 )
11 months ago
nihui
ef5e79d80e
add missing macros for VK_NV_cooperative_vector and VK_NV_cooperative_matrix2 ( #6119 )
11 months ago
nihui
12f57fb3d1
discover VK_NV_cooperative_matrix2 ( #6118 )
11 months ago
nihui
510b461e9a
discover VK_NV_cooperative_vector ( #6117 )
11 months ago
nihui
9cdc02bb7a
unified vulkan khr/nv cooperative matrix shader ( #6116 )
11 months ago
nihui
23b64c9cf9
fix vulkan validation error, do not enforce local_size_x be multiple of subgroup size ( #6111 )
11 months ago
nihui
ebc041cc56
force subgroup 32 for cooperative matrix shader atm ( #6100 )
11 months ago
tpoisonooo
dddfc50282
raise max gpu count to 32 ( #6084 )
1 year ago
nihui
6510fe6125
discover VK_KHR_shader_float_controls2 ( #6068 )
1 year ago
nihui
ca045ac579
gpu info query cooperative matrix properties ( #6067 )
1 year ago
nihui
7f899f2e94
update glslang, discover VK_KHR_shader_integer_dot_product and VK_KHR_shader_bfloat16 ( #6066 )
1 year ago
nihui
572b6198ad
fix subgroup min max size without subgroup size control extension ( #6047 )
1 year ago
nihui
305837fd4a
fix vulkan driver blacklist ( #6034 )
1 year ago
nihui
80da741307
glsl define ncnn_glsl_version macro ( #6003 )
1 year ago
nihui
33d03e625c
drop ncnn glsl string literal macros ( #5975 )
glsl has no string type anyway
1 year ago
nihui
8dbcfee5ec
option owns vulkan device index ( #5973 )
1 year ago
nihui
84970eed4d
vulkan validation layer enables NCNN_LOGE in shader source ( #5963 )
* NCNN_LOGE in glsl
* Update glsl-extension.md
1 year ago
nihui
b284dbd0f4
discover VK_KHR_shader_non_semantic_info, checked convolution imagestore ( #5955 )
1 year ago
nihui
eed257df1f
ci update llvmpipe ( #5954 )
* check image fp16
1 year ago
nihui
ad30c7f6fb
clean vulkan shader common extension ( #5952 )
* clean vulkan shader common extension
* macro suffix makes glslang unhappy
1 year ago
nihui
bf13c30210
define device feature macros for glslang, discover VK_EXT_shader_atomic_float and VK_EXT_shader_atomic_float2 ( #5949 )
1 year ago
nihui
8211930a6f
discover VK_KHR_shader_subgroup_rotate ( #5948 )
1 year ago
nihui
1b6485fa17
discover VK_KHR_zero_initialize_workgroup_memory ( #5947 )
1 year ago
nihui
40f7b4e527
discover all subgroup features and VK_KHR_shader_subgroup_extended_types ( #5946 )
1 year ago
nihui
0b9925cfef
intergrate VK_EXT_subgroup_size_control features and properties ( #5940 )
1 year ago
Upliner Mikhalych
cbd17cd062
Fix #5741 don't crash when vkCreateDevice fails ( #5742 )
1 year ago
nihui
bd1f39ed82
blacklist mesa vulkan cooperative matrix feature ( #5739 )
ref https://gitlab.freedesktop.org/mesa/mesa/-/issues/10847
1 year ago
Asd-g
bbf2e5d533
create_gpu_instance: do not perform destroy_gpu_instance() ( #5437 )
When performing destroy_gpu_instance(), g_instance.created is always 0.
1 year ago
zhouwg
fbfa1190e1
apply code-format changes
2 years ago
zhou.weiguo
89aa45c723
fix build issue: no member named create_gpu_instance in namespace ncnn
2 years ago
張小凡
3b048d1923
destroy_gpu_instance() function wait for all devices to be idle before destroy ( #4763 )
* destroy_gpu_instance() will internally ensure that all vulkan devices are idle before proceeding with destruction.
2 years ago
Shatyuka
e7748e5311
Fix `destroy_gpu_instance` crash ( #5353 )
* Fix `destroy_gpu_instance` crash
* Additional check and clear
2 years ago
nihui
05b4dcb06c
report vulkan cm 8x8x16 config, enable fp16a cm ( #5298 )
2 years ago
nihui
5329d32e74
check vulkan fp16 uniform support and implement lfp conversion without fp16u ( #5287 )
2 years ago
nihui
556b79ce4d
create layer decoupled ( #5258 )
* create layer decoupled
* no more virtual public
* allow build test with shared library
* decouple cpu vulkan
* drop old scripts
2 years ago
nihui
ded0b78bb2
fix nvidia vulkan crash on exit ( #5234 )
2 years ago
nihui
8c4fc5e2a0
enable uniform 16bit and 8bit when available, fix validation error in fp16sa shader ( #5233 )
2 years ago
nihui
b4f26237cb
in-house vulkan loader ( #5130 )
* vulkan-driver-loader.md
* static vulkan on apple
2 years ago