nihui/ncnn - ncnn - 开源协同云脑生态支撑系统

Commit Graph

Author	SHA1	Message	Date
nihui	bb5bfe3841	avx2 infrastructure (#1943 )	5 years ago
nihui	11cffce114	armv8.2 infrastructure (#1856 ) * runtime cpu dispatch * force thread one * disable openmp for coverage * simplify test layer * print NCNN_TARGET_ARCH * less ci build variants * weight fp16 storage option * test convdw int8 * apple a12 a13 * ncnn_add_layer ncnn_add_shader cmake macro	5 years ago
nihui	fe6bc1ed4d	Ci rv64gcv and rv64gc (#1936 )	5 years ago
nihuini	f3b182da1f	fix ci build	6 years ago
nihuini	989b0f70cc	convert shader source to hex data at build time	6 years ago
nihuini	b5f85eee13	fix image1d_xx8 macro, normalize image shader	6 years ago
nihuini	6682cd1638	image fp16pa, mark some bugihfa todo	6 years ago
nihui	e8688b042f	fuse packing cast storage, binaryop image shader, dummy buffer and image, device-wide utility packing converter operators, fix multi-blob layer test	6 years ago
nihui	62da1228e1	adreno image shader + fp16 + fp16a (#1714 ) * wip * wip * fix * image and imageview can not be destroyed until command execution ends * fast copy path for tightly packed data * wip * texture load works * 1d 3d image * record clone image, multiple commands share one image reference * upload download image * layer forward accept vkimagemat * vkimagemat graph works * staging vkimagemat for passing dynamic parameters, macro for fp32+image shader, padding image shader * vkimagemat elemsize * convolution test pass * conv1x1s1 image shader * fast staging image allocator from host memory, pooling image shader * convolutiondepthwise image shader * innerproduct image shader * packing image shader * crop deconvolution image shader * resolve spirv binding types * image fp16 and fp16a, cast image shader * eltwise image shader * wip * absval image shader * deconvolutiondepthwise image shader * concat image shader, squeezenet works * noop split image shader * uniform precision hint * layer support_image_storage * wip * vulkan device utility operator * command is storage and packing option aware * fallback to cpu on image allocation failed, mobilenetssd works * flatten image shader, enable more test * ci test * check imgfp32 imgfp16 imgfp16a features * fix ci test * fix ci test * upgrade swiftshader * wip * opt aggressive * imgfp16p * opt none * convolution winograd image shader * fix flush range, fast copy path for continous buffer * minor fix * fix innerproduct * wip ... * wip * cast fix * packing test * wip * image fp16p is fp16p * wip * silence * more line info * code clean * softmax image shader	6 years ago
nihuini	1ea9de3bdf	create shader pipeline by type index, resolve binding count and push constant count from spirv. since we don't create compound shader module for macos and ios compatibility, it is enough to use fixed main as the shader entry point	6 years ago
nihui	999da7158f	old glslang reject -Os option, as optimizing for size does not make a big difference, drop it for now, fix #1544	6 years ago
nihui	bbaa4dcce2	compile fp16pa, optimize shader for size, enable implicit fp16 arithmetic for qcom855 and qcom855plus	6 years ago
nihui	0f7e7bca02	shader shape specialization constant and basic local group size partition (#1523 ) * use Mat class for Shape description * shape specialization constant in compute shader * wip * wip * test forward_inplace, add binaryop unaryop sigmoid test * fix arm unaryop test * fix arm binaryop test * make shape hint optional, cast int8 to fp32, add cast test * wip * follow the good and old local size setting for conv1x1 * the optimal local size rewrite * fix build on msvc * add permute shader for all packing layout, add permute test * concat and slice patial shape constant, slice test * fix slice test * interp test * add lrn test, test packing layout implicitly * add eltwise test * add normalize test * add instancenorm test * reorg shape constant * simple local group size partition * add shape constant param	6 years ago
nihui	33b16811ce	reimplement sfp afp conversion macro as function style buffer load store, drop lds shader for the moment	6 years ago
nihui	5042d14d7d	define sfpvec8 afpvec8 macro, use modern glsl extension for fp16 arithmetic, fix padding aarch64 build	6 years ago
nihuini	628989770b	return values correctly	6 years ago
nihuini	eb9326002f	cmake ncnn_generate_shader_spv_header function	6 years ago
Natsu	637d96c1d2	Fix gcc 9 compilation failure (#1189 ) * Fix gcc 9 compilation failure * Fix compilation failure on linux gcc * Fix compilation failure on old gcc * Remove C++11 requirement	6 years ago
Natsu	6d1944f2c3	CMake improvement (#1115 ) * CMake improvement * Fix bugs * Fix typo * Propagate vulkan dependency * import vulkan * add config files, now exported target cmake should be able to find packages * Propagate no-rtti and no-exception * Provide a option to control rtti and exception in mobile platform * Make cmake clean * Resolve conflicts * Update CMake PIE is propagated by INTERFACE_POSITION_INDEPENDENT_CODE * Remove bad things	6 years ago

19 Commits (bedf00a5edf85ae8af33bc72ebd85fe325da1271)