nihui/ncnn - ncnn - 开源协同云脑生态支撑系统

Commit Graph

Author	SHA1	Message	Date
nihuini	838c5df839	option api changes	7 years ago
nihuini	4de4078779	move platform includes out of namespace	7 years ago
BUG1989	b53541e8f9	fix arm winograd int8,optimize winograd x86 (#1025 )	7 years ago
BUG1989	01b3804828	optimization the x86 convolution layer with avx2 (#1019 ) * add the "Tu Fa" conv sgemm fp32 with avx2 for x86 * add avx2 cmake option * fix some bugs of avx2 pull request	7 years ago
nihui	3e003ffd98	fuse sigmoid	7 years ago
nihuini	7a8f68aca6	move vulkan code to subdir, new layer interface create_pipeline and destroy_pipeline for post-loading works	7 years ago
nihuini	3f85cafc08	fuse relu leakyrelu clip into convolution/deconvolution/innerproduct	7 years ago
BUG1989	93a34a897d	add int8 winograd F(4,3) with neon assembly optimization (#891 ) * add the implement of int8 winograd F(4,3) * add int8 winograd F(4,3) naive c to arm64-v8a platform * optimize int8 winograd F(4,3) with neon * merge dequant op into int8 winograd F(4,3) * enable int8 wino F(4,3) case with all size	7 years ago
BUG1989	780c7d9a72	merge de/requantize op, optimize some int8 conv layer on arm64-v8a (#867 ) * optimize the conv sgemm int8 on arm64-v8a platform * optimize int8 arm64-v8a with sadalp ins * merge requantize op into latest conv layer * merge requantize op into conv-int8 op * update the mobilenet.param in the benchmark * Update README.md update Kirin970 and RK3399 * try to fix the travis build error	7 years ago
BUG1989	8e337d440e	fix the bug with convdw7x7 op working on int8 mode (#818 )	7 years ago
BUG1989	df3d224484	new int8 implement,better accuracy (#749 ) * add the armv7a conv3x3s1 implement without overflow,remove old codes * fix the bug of conv3x3s2 packed int8 * new int8 implement,weight quant by perchanel,better accuracy~ * fix the bug of conv3x3s1 packed int8 neon * add the naive c fp32 and int8 winograd F(2,3) * add the neon intrinsic int8 winograd F(2,3) * optimize the armv7a int8 winograd F(2,3) with neon assembly * optimize the armv7a int8 winograd F(2,3) input transform with assembly. * add the requantize layer and int8 relu implement. * add graph optimize conv1x1s2 -> conv1x1s1,begin optimize int8 aarch64. * fix int8 bugs * add the c naive im2col with sgemm * add aarch64 int8 winograd f23, conv3x3s2 naive implement * add the int8 sgemm conv7x7s2 on x86/armv7a platform * optimize the int8 sgemm by neon intrinsic and packed kernel * optimize the int8 sgemm with packed data * optimize the int8 sgemm with armv7a neon assembly * add the int8 sgemm on arm64-v8a platform * perpare to merge latest codes from master * add the int8 param files * In the Class Net,add the fuse_network method	7 years ago
nihuini	6f1b0b0a61	quantized padding in convolution, use range sweets	7 years ago
nihui	72411b7a6c	restore the old conv3x3s2 as reference, fast dilation convolution fails on striding	7 years ago
nihui	fe14037777	more sub op preload	7 years ago
nihui	5d04a3a45c	layer holds bottom blob scale, depthwise convolution read group scales	7 years ago
nihuini	4be27a0a89	int8 inference on x86	7 years ago
nihui	a169cec363	core int8 inference, quantize and dequantize, net using flag, caffe2ncnn reads int8 scale table	7 years ago
nihui	9706cd1447	implement ncnn blob/workspace allocator, fine-grained per-layer openmp threads control, fix #469	7 years ago
Hyungsuk Yoon	8f56e00b4b	make convolution with dilation fast	8 years ago
nihui	7d1e49584d	call Innerproduct for convolution on flattened blob	8 years ago
nihuini	03621aa7f9	more x86 stub for convolution and convolutiondepthwise	8 years ago
nihui	bdb70a2010	padding w h in convolution and deconvolution	8 years ago
nihui	44b4519307	non-square convolution and deconvolution kernel stride dilation	8 years ago
Hyungsuk Yoon	c641db8034	Fix bug for convolution on x86	8 years ago
nihuini	47218db6e5	fix minus padding SAME, fix #116	8 years ago
nihuini	23630b14b9	implement tensorflow style padding SAME type for convolution and pooling, second try	8 years ago
Zuyu Zhang	d5f0214ec7	Fixed the build warnings and added gitignore.	8 years ago
nihuini	b7db8be4f6	add ncnn source qwq	9 years ago

28 Commits (838c5df839004236bbd350ffe93fabfc888dd7fc)