nihui/ncnn - ncnn - 开源协同云脑生态支撑系统

History

nihui c4a007406d windows clang ci (#5469 ) * windows clang ci * clang msvc use x86intrin.h for xop * test arm64 compiler features		2 years ago
..
arm	windows clang ci (#5469)	2 years ago

loongarch	update ncnnoptimize layers, lightmode=false keeps original weight (#5414)	2 years ago

mips	update ncnnoptimize layers, lightmode=false keeps original weight (#5414)	2 years ago

riscv	rnn/lstm/gru dynamic quantization (#5435)	2 years ago

vulkan	update ncnnoptimize layers, lightmode=false keeps original weight (#5414)	2 years ago

x86	windows clang ci (#5469)	2 years ago

absval.cpp	layer creator function is not necessary for built-in layers	6 years ago

absval.h	move vulkan code to subdir, new layer interface create_pipeline and destroy_pipeline for post-loading works	7 years ago

argmax.cpp	Support remove libstdc++ denpendency (#2030)	5 years ago

argmax.h	implement ncnn blob/workspace allocator, fine-grained per-layer openmp threads control, fix #469	8 years ago

batchnorm.cpp	implement simplemath (#4905)	2 years ago

batchnorm.h	move vulkan code to subdir, new layer interface create_pipeline and destroy_pipeline for post-loading works	7 years ago

bias.cpp	relu3d, batchnorm3d, reshape4d, flatten4d, permute4d (#3397)	4 years ago

bias.h	implement ncnn blob/workspace allocator, fine-grained per-layer openmp threads control, fix #469	8 years ago

binaryop.cpp	implement simplemath (#4905)	2 years ago

binaryop.h	implement atan2 (#4516)	3 years ago

bnll.cpp	implement simplemath (#4905)	2 years ago

bnll.h	implement ncnn blob/workspace allocator, fine-grained per-layer openmp threads control, fix #469	8 years ago

cast.cpp	initial data structure changes for 3dcnn, conv3d, pooling3d (#3378)	4 years ago

cast.h	cast between float32 and bfloat16	6 years ago

celu.cpp	implement simplemath (#4905)	2 years ago

celu.h	Celu layer and export to ncnn (#5019)	2 years ago

clip.cpp	clip gelu mish tanh 4d (#4695)	3 years ago

clip.h	architecture changes for int8 packing (#2771)	5 years ago

concat.cpp	concat 4d (#4528)	3 years ago

concat.h	move vulkan code to subdir, new layer interface create_pipeline and destroy_pipeline for post-loading works	7 years ago

convolution.cpp	create layer decoupled (#5258)	2 years ago

convolution.h	create layer decoupled (#5258)	2 years ago

convolution1d.cpp	create layer decoupled (#5258)	2 years ago

convolution1d.h	create layer decoupled (#5258)	2 years ago

convolution3d.cpp	initial data structure changes for 3dcnn, conv3d, pooling3d (#3378)	4 years ago

convolution3d.h	initial data structure changes for 3dcnn, conv3d, pooling3d (#3378)	4 years ago

convolutiondepthwise.cpp	create layer decoupled (#5258)	2 years ago

convolutiondepthwise.h	create layer decoupled (#5258)	2 years ago

convolutiondepthwise1d.cpp	pnnx enhance functionize (#5387)	2 years ago

convolutiondepthwise1d.h	create layer decoupled (#5258)	2 years ago

convolutiondepthwise3d.cpp	fix too many microtask error in old libomp runtime (#4002)	3 years ago

convolutiondepthwise3d.h	initial data structure changes for 3dcnn, conv3d, pooling3d (#3378)	4 years ago

copyto.cpp	copyto operator (#4522)	3 years ago

copyto.h	copyto operator (#4522)	3 years ago

crop.cpp	crop4d (#3402)	4 years ago

crop.h	crop4d (#3402)	4 years ago

cumulativesum.cpp	Support torch.cumsum (#4505)	3 years ago

cumulativesum.h	Support torch.cumsum (#4505)	3 years ago

deconvolution.cpp	deconvolution dynamic weight (#5119)	2 years ago

deconvolution.h	deconvolution dynamic weight (#5119)	2 years ago

deconvolution1d.cpp	deconvolution dynamic weight (#5119)	2 years ago

deconvolution1d.h	deconvolution dynamic weight (#5119)	2 years ago

deconvolution3d.cpp	fix too many microtask error in old libomp runtime (#4002)	3 years ago

deconvolution3d.h	deconv1d deconv3d (#3584)	4 years ago

deconvolutiondepthwise.cpp	deconvolution dynamic weight (#5119)	2 years ago

deconvolutiondepthwise.h	deconvolution dynamic weight (#5119)	2 years ago

deconvolutiondepthwise1d.cpp	deconvolution dynamic weight (#5119)	2 years ago

deconvolutiondepthwise1d.h	deconvolution dynamic weight (#5119)	2 years ago

deconvolutiondepthwise3d.cpp	fix too many microtask error in old libomp runtime (#4002)	3 years ago

deconvolutiondepthwise3d.h	deconv1d deconv3d (#3584)	4 years ago

deepcopy.cpp	layer creator function is not necessary for built-in layers	6 years ago

deepcopy.h	deepcopy layer	6 years ago

deformableconv2d.cpp	reduce double usage (#4671)	3 years ago

deformableconv2d.h	Add DeformableConv2D (#4070)	3 years ago

dequantize.cpp	architecture changes for int8 packing (#2771)	5 years ago

dequantize.h	architecture changes for int8 packing (#2771)	5 years ago

detectionoutput.cpp	implement simplemath (#4905)	2 years ago

detectionoutput.h	mxnet-ssd wip ...	7 years ago

diag.cpp	add diag layer and its converter (#4935)	2 years ago

diag.h	add diag layer and its converter (#4935)	2 years ago

dropout.cpp	implement simplemath (#4905)	2 years ago

dropout.h	move vulkan code to subdir, new layer interface create_pipeline and destroy_pipeline for post-loading works	7 years ago

einsum.cpp	implement einsum layer and pnnx conversion (#3768)	4 years ago

einsum.h	implement einsum layer and pnnx conversion (#3768)	4 years ago

eltwise.cpp	eltwise 4d (#4529)	3 years ago

eltwise.h	format code style and setup restyled.io (#1840)	6 years ago

elu.cpp	implement simplemath (#4905)	2 years ago

elu.h	implement ncnn blob/workspace allocator, fine-grained per-layer openmp threads control, fix #469	8 years ago

embed.cpp	layer creator function is not necessary for built-in layers	6 years ago

embed.h	implement ncnn blob/workspace allocator, fine-grained per-layer openmp threads control, fix #469	8 years ago

erf.cpp	disable rtti and exceptions for msvc (#5167)	2 years ago

erf.h	erf implementation (#5012)	2 years ago

exp.cpp	implement simplemath (#4905)	2 years ago

exp.h	implement ncnn blob/workspace allocator, fine-grained per-layer openmp threads control, fix #469	8 years ago

expanddims.cpp	squeeze and expanddims 4d (#4346)	3 years ago

expanddims.h	squeeze and expanddims 4d (#4346)	3 years ago

flatten.cpp	relu3d, batchnorm3d, reshape4d, flatten4d, permute4d (#3397)	4 years ago

flatten.h	move vulkan code to subdir, new layer interface create_pipeline and destroy_pipeline for post-loading works	7 years ago

fold.cpp	clear warning: unused variable while building on x86-wsl platform (#5444)	2 years ago

fold.h	implement ncnn fold and unfold (#4326)	3 years ago

fused_activation.h	create layer decoupled (#5258)	2 years ago

gelu.cpp	implement simplemath (#4905)	2 years ago

gelu.h	update modelwriter for new operators	4 years ago

gemm.cpp	shadowed variable for less openmp task args (#4744)	3 years ago

gemm.h	gemm output transpose, prepack c (#4479)	3 years ago

glu.cpp	implement simplemath (#4905)	2 years ago

glu.h	implement GLU and pnnx conversion (#4283)	3 years ago

gridsample.cpp	implement simplemath (#4905)	2 years ago

gridsample.h	Grid sample op (#4373)	2 years ago

groupnorm.cpp	implement simplemath (#4905)	2 years ago

groupnorm.h	fuse onnx groupnorm	5 years ago

gru.cpp	rnn/lstm/gru dynamic quantization (#5435)	2 years ago

gru.h	rnn/lstm/gru dynamic quantization (#5435)	2 years ago

hardsigmoid.cpp	add comment for alpha beta in hardswish and hardsigmoid compared to tf/pytorch implementation (#2859)	5 years ago

hardsigmoid.h	Implemented hard sigmoid (#1046)	7 years ago

hardswish.cpp	add comment for alpha beta in hardswish and hardsigmoid compared to tf/pytorch implementation (#2859)	5 years ago

hardswish.h	Implemented hard swish layer	6 years ago

innerproduct.cpp	create layer decoupled (#5258)	2 years ago

innerproduct.h	create layer decoupled (#5258)	2 years ago

input.cpp	relu3d, batchnorm3d, reshape4d, flatten4d, permute4d (#3397)	4 years ago

input.h	relu3d, batchnorm3d, reshape4d, flatten4d, permute4d (#3397)	4 years ago

instancenorm.cpp	implement simplemath (#4905)	2 years ago

instancenorm.h	instancenorm without affine	5 years ago

interp.cpp	fix interp nearest by scale factor, fix issue #3555 (#3565)	4 years ago

interp.h	add interp param align_corner (#2236)	5 years ago

layernorm.cpp	implement simplemath (#4905)	2 years ago

layernorm.h	fuse onnx layernorm, fix 2-dim layernorm implementation, add test	5 years ago

log.cpp	implement simplemath (#4905)	2 years ago

log.h	implement ncnn blob/workspace allocator, fine-grained per-layer openmp threads control, fix #469	8 years ago

lrn.cpp	implement simplemath (#4905)	2 years ago

lrn.h	format code style and setup restyled.io (#1840)	6 years ago

lstm.cpp	rnn/lstm/gru dynamic quantization (#5435)	2 years ago

lstm.h	rnn/lstm/gru dynamic quantization (#5435)	2 years ago

matmul.cpp	convert torch.matmul, eliminate noop pad and identity op, fuse transpose matmul, fuse select to unbind (#3554)	4 years ago

matmul.h	convert torch.matmul, eliminate noop pad and identity op, fuse transpose matmul, fuse select to unbind (#3554)	4 years ago

memorydata.cpp	support tag in memorydata layer (#5061)	2 years ago

memorydata.h	support tag in memorydata layer (#5061)	2 years ago

mish.cpp	implement simplemath (#4905)	2 years ago

mish.h	Add mish layer (#1733)	6 years ago

multiheadattention.cpp	multiheadattention attn mask (#4668)	3 years ago

multiheadattention.h	multiheadattention attn mask (#4668)	3 years ago

mvn.cpp	implement simplemath (#4905)	2 years ago

mvn.h	implement ncnn blob/workspace allocator, fine-grained per-layer openmp threads control, fix #469	8 years ago

noop.cpp	create layer decoupled (#5258)	2 years ago

noop.h	create layer decoupled (#5258)	2 years ago

normalize.cpp	implement simplemath (#4905)	2 years ago

normalize.h	normalize support inplace	6 years ago

packing.cpp	initial data structure changes for 3dcnn, conv3d, pooling3d (#3378)	4 years ago

packing.h	fuse packing cast storage, binaryop image shader, dummy buffer and image, device-wide utility packing converter operators, fix multi-blob layer test	6 years ago

padding.cpp	initial data structure changes for 3dcnn, conv3d, pooling3d (#3378)	4 years ago

padding.h	initial data structure changes for 3dcnn, conv3d, pooling3d (#3378)	4 years ago

permute.cpp	relu3d, batchnorm3d, reshape4d, flatten4d, permute4d (#3397)	4 years ago

permute.h	move vulkan code to subdir, new layer interface create_pipeline and destroy_pipeline for post-loading works	7 years ago

pixelshuffle.cpp	pixelshuffle nhwc mode, convert onnx DepthToSpace mode DCR, convert mlir tf.DepthToSpace	5 years ago

pixelshuffle.h	pixelshuffle nhwc mode, convert onnx DepthToSpace mode DCR, convert mlir tf.DepthToSpace	5 years ago

pooling.cpp	fuse adaptive pool dynamic output size, implement ncnn adaptive pooling dynamic outsize (#5043)	2 years ago

pooling.h	support PyTorch AdaptiveAvgPool2d and AdaptiveMaxPool2d (#2546)	5 years ago

pooling1d.cpp	use int ceil/floor div (#3333)	4 years ago

pooling1d.h	implement convolution1d and pooling1d (#3035)	5 years ago

pooling3d.cpp	fuse adaptive pool dynamic output size, implement ncnn adaptive pooling dynamic outsize (#5043)	2 years ago

pooling3d.h	initial data structure changes for 3dcnn, conv3d, pooling3d (#3378)	4 years ago

power.cpp	implement simplemath (#4905)	2 years ago

power.h	implement ncnn blob/workspace allocator, fine-grained per-layer openmp threads control, fix #469	8 years ago

prelu.cpp	layer creator function is not necessary for built-in layers	6 years ago

prelu.h	move vulkan code to subdir, new layer interface create_pipeline and destroy_pipeline for post-loading works	7 years ago

priorbox.cpp	implement simplemath (#4905)	2 years ago

priorbox.h	add mobilenetv3-ssd (#1335)	6 years ago

proposal.cpp	implement simplemath (#4905)	2 years ago

proposal.h	implement ncnn blob/workspace allocator, fine-grained per-layer openmp threads control, fix #469	8 years ago

psroipooling.cpp	implement simplemath (#4905)	2 years ago

psroipooling.h	implement psroipooling layer	7 years ago

quantize.cpp	implement simplemath (#4905)	2 years ago

quantize.h	architecture changes for int8 packing (#2771)	5 years ago

reduction.cpp	workaround l2 norm produce -inf value with subnormals (#5272)	2 years ago

reduction.h	format code style and setup restyled.io (#1840)	6 years ago

relu.cpp	initial data structure changes for 3dcnn, conv3d, pooling3d (#3378)	4 years ago

relu.h	architecture changes for int8 packing (#2771)	5 years ago

reorg.cpp	pixelshuffle nhwc mode, convert onnx DepthToSpace mode DCR, convert mlir tf.DepthToSpace	5 years ago

reorg.h	reorg nhwc mode, code format	5 years ago

requantize.cpp	requantize: Use activation_ss in fused_activation.h (#5245)	2 years ago

requantize.h	architecture changes for int8 packing (#2771)	5 years ago

reshape.cpp	relu3d, batchnorm3d, reshape4d, flatten4d, permute4d (#3397)	4 years ago

reshape.h	relu3d, batchnorm3d, reshape4d, flatten4d, permute4d (#3397)	4 years ago

rnn.cpp	rnn/lstm/gru dynamic quantization (#5435)	2 years ago

rnn.h	rnn/lstm/gru dynamic quantization (#5435)	2 years ago

roialign.cpp	implement simplemath (#4905)	2 years ago

roialign.h	Improve ROIAlign (accelerate ROIAlign, support sampling ratio and aligned ROIAlign) (#1820)	6 years ago

roipooling.cpp	implement simplemath (#4905)	2 years ago

roipooling.h	implement ncnn blob/workspace allocator, fine-grained per-layer openmp threads control, fix #469	8 years ago

scale.cpp	layer creator function is not necessary for built-in layers	6 years ago

scale.h	move vulkan code to subdir, new layer interface create_pipeline and destroy_pipeline for post-loading works	7 years ago

selu.cpp	implement simplemath (#4905)	2 years ago

selu.h	implemented selu activation (#1051)	7 years ago

shrink.cpp	shrink operator (#5022)	2 years ago

shrink.h	shrink operator (#5022)	2 years ago

shufflechannel.cpp	fix shufflechannel.cpp integer multiply may overflow (#2212)	5 years ago

shufflechannel.h	shufflechannel reverse group style	6 years ago

sigmoid.cpp	implement simplemath (#4905)	2 years ago

sigmoid.h	move vulkan code to subdir, new layer interface create_pipeline and destroy_pipeline for post-loading works	7 years ago

slice.cpp	slice with indices (#5103)	2 years ago

slice.h	slice with indices (#5103)	2 years ago

softmax.cpp	implement simplemath (#4905)	2 years ago

softmax.h	move vulkan code to subdir, new layer interface create_pipeline and destroy_pipeline for post-loading works	7 years ago

softplus.cpp	implement simplemath (#4905)	2 years ago

softplus.h	softplus inplace (#2156)	5 years ago

split.cpp	create layer decoupled (#5258)	2 years ago

split.h	create layer decoupled (#5258)	2 years ago

spp.cpp	implement simplemath (#4905)	2 years ago

spp.h	format code style and setup restyled.io (#1840)	6 years ago

squeeze.cpp	squeeze and expanddims 4d (#4346)	3 years ago

squeeze.h	squeeze and expanddims 4d (#4346)	3 years ago

statisticspooling.cpp	implement simplemath (#4905)	2 years ago

statisticspooling.h	add statisticspooling layer (#1768)	6 years ago

swish.cpp	implement simplemath (#4905)	2 years ago

swish.h	Add Swish layer (#1799)	6 years ago

tanh.cpp	implement simplemath (#4905)	2 years ago

tanh.h	move vulkan code to subdir, new layer interface create_pipeline and destroy_pipeline for post-loading works	7 years ago

threshold.cpp	layer creator function is not necessary for built-in layers	6 years ago

threshold.h	implement ncnn blob/workspace allocator, fine-grained per-layer openmp threads control, fix #469	8 years ago

tile.cpp	layer tile (#3491)	4 years ago

tile.h	layer tile (#3491)	4 years ago

unaryop.cpp	implement simplemath (#4905)	2 years ago

unaryop.h	pnnx convert torch round trunc (#4813)	3 years ago

unfold.cpp	implement ncnn fold and unfold (#4326)	3 years ago

unfold.h	implement ncnn fold and unfold (#4326)	3 years ago

yolodetectionoutput.cpp	create layer decoupled (#5258)	2 years ago

yolodetectionoutput.h	move vulkan code to subdir, new layer interface create_pipeline and destroy_pipeline for post-loading works	7 years ago

yolov3detectionoutput.cpp	create layer decoupled (#5258)	2 years ago

yolov3detectionoutput.h	Add yolov3detectionoutput test and AVX optimization (#1994)	5 years ago

No Description

C++ C Python Text Protocol Buffer other

nihuini@tencent.com shuizhuyuanluo@126.com imzhuo@foxmail.com 49699333+dependabot[bot]@users.noreply.github.com 248857878@qq.com khj.application@aliyun.com e.proydakov@gmail.com zhiliu6@gmail.com 67086033+ncnnnnn@users.noreply.github.com caishanli25@gmail.com Timen@users.noreply.github.com 314377460@qq.com Mouri_Naruto@Outlook.com 19603468+cmdbug@users.noreply.github.com me@lstlx.com 2672931+whyb@users.noreply.github.com 244188+JeremyRand@users.noreply.github.com leo@nullptr.com.cn mingzilaochongtu@gmail.com myhouseng@gmail.com 77330637+LRY89757@users.noreply.github.com daquexian566@gmail.com nihui@users.noreply.github.com 774074168@qq.com gemfield@civilnet.cn

Contributors (25+) All

Contributors (25+)
All