nihui/ncnn - ncnn - 开源协同云脑生态支撑系统

354 MB

Tree: 7a0c19c856

Author	SHA1	Message	Date
nihui	0f7e7bca02	shader shape specialization constant and basic local group size partition (#1523 ) * use Mat class for Shape description * shape specialization constant in compute shader * wip * wip * test forward_inplace, add binaryop unaryop sigmoid test * fix arm unaryop test * fix arm binaryop test * make shape hint optional, cast int8 to fp32, add cast test * wip * follow the good and old local size setting for conv1x1 * the optimal local size rewrite * fix build on msvc * add permute shader for all packing layout, add permute test * concat and slice patial shape constant, slice test * fix slice test * interp test * add lrn test, test packing layout implicitly * add eltwise test * add normalize test * add instancenorm test * reorg shape constant * simple local group size partition * add shape constant param	6 years ago
nihuini	9b33e647bd	use fixed blob names for benchmark	7 years ago
nihui	2fe769f314	update fused param files, enable ncnnoptimize tool build	7 years ago
BUG1989	df3d224484	new int8 implement,better accuracy (#749 ) * add the armv7a conv3x3s1 implement without overflow,remove old codes * fix the bug of conv3x3s2 packed int8 * new int8 implement,weight quant by perchanel,better accuracy~ * fix the bug of conv3x3s1 packed int8 neon * add the naive c fp32 and int8 winograd F(2,3) * add the neon intrinsic int8 winograd F(2,3) * optimize the armv7a int8 winograd F(2,3) with neon assembly * optimize the armv7a int8 winograd F(2,3) input transform with assembly. * add the requantize layer and int8 relu implement. * add graph optimize conv1x1s2 -> conv1x1s1,begin optimize int8 aarch64. * fix int8 bugs * add the c naive im2col with sgemm * add aarch64 int8 winograd f23, conv3x3s2 naive implement * add the int8 sgemm conv7x7s2 on x86/armv7a platform * optimize the int8 sgemm by neon intrinsic and packed kernel * optimize the int8 sgemm with packed data * optimize the int8 sgemm with armv7a neon assembly * add the int8 sgemm on arm64-v8a platform * perpare to merge latest codes from master * add the int8 param files * In the Class Net,add the fuse_network method	7 years ago

4 Commits (7a0c19c8563e0e65d458c548f983c0d2dcdb36ba)