hugo-syn
|
f35eb4b3b8
|
chore: Fix multiple typos (#5301)
Signed-off-by: hugo-syn <hugo.vincent@synacktiv.com>
|
2 years ago |
nihui
|
5329d32e74
|
check vulkan fp16 uniform support and implement lfp conversion without fp16u (#5287)
|
2 years ago |
nihui
|
c222208cc9
|
feat mask for disable threading, make some extractor setter no-op, update doc (#5270)
|
2 years ago |
張小凡
|
2ecaf37a3e
|
Fix find GPU driver dll path in windows (#5141)
|
2 years ago |
nihui
|
b4f26237cb
|
in-house vulkan loader (#5130)
* vulkan-driver-loader.md
* static vulkan on apple
|
2 years ago |
nihui
|
4494aadd74
|
deconvolution dynamic weight (#5119)
|
2 years ago |
nihui
|
14e14a9ae8
|
slice with indices (#5103)
|
2 years ago |
Yoh
|
3f437d3f3d
|
Grid sample op (#4373)
* pnnx support grid_sample op
* complete the permute and gridsample operator fusion
* spilt calculation into two stages and support permute fusion
|
2 years ago |
Amir Ramezani
|
7e5fa3ade3
|
shrink operator (#5022)
|
2 years ago |
Beq Jal
|
bcfec1da33
|
Celu layer and export to ncnn (#5019)
|
2 years ago |
Beq Jal
|
c851231832
|
add diag layer and its converter (#4935)
|
2 years ago |
nihui
|
4abadd2ffb
|
binaryop implicit broadcast B with 1 dimension rank for outer axis (#4930)
|
2 years ago |
張小凡
|
1e0d70af8c
|
Add translated document: glsl-extension.zh.md (#4818)
|
2 years ago |
nihui
|
43aba6badb
|
Update glsl-extension.md
|
2 years ago |
nihui
|
172b748c74
|
add ncnn glsl extension doc (#4817)
|
2 years ago |
nihui
|
9022b7162a
|
implement all explicit binaryop broadcast types (#4809)
* simplify binaryop
* less gpu test
* update binaryop broadcast doc
* do not test atan2 zero
|
2 years ago |
nihui
|
c28c8c04a1
|
multiheadattention attn mask (#4668)
|
3 years ago |
nihui
|
b640574b88
|
rough vulkan gemm and multiheadattention (#4618)
|
3 years ago |
nihui
|
afc9310c62
|
update new operators for modelwriter (#4540)
|
3 years ago |
nihui
|
fc6ce4a641
|
copyto operator (#4522)
|
3 years ago |
nihui
|
242e775d21
|
pnnx convert torch log10, pow 2 as square (#4518)
|
3 years ago |
nihui
|
246e71c526
|
implement atan2 (#4516)
|
3 years ago |
Fangjun Kuang
|
92e75105c9
|
Support torch.cumsum (#4505)
|
3 years ago |
nihui
|
ab4cfbf5b0
|
enrich ncnn binary broadcast rules (#4513)
|
3 years ago |
nihui
|
fed99fd35b
|
gemm output transpose, prepack c (#4479)
* mha is now permute and reshape free
* gemm user defined tile mnk param
|
3 years ago |
WuJinxuan
|
10e9d91576
|
Add x86 MultiHeadAttention (#4443)
* fix doc, sync x86 gemm fix
Co-authored-by: EdVince <EdVince@users.noreply.github.com>
Co-authored-by: nihuini <nihuini@tencent.com>
|
3 years ago |
nihui
|
fd1ac3c7a0
|
x86 optimization for gemm unified elempack (#4387)
|
3 years ago |
nihui
|
eceac35a7f
|
implement MultiheadAttention kdim vdim (#4347)
|
3 years ago |
Lry89757
|
6a47f8d15c
|
gridsample op support (#4288)
Co-authored-by: LRY89757 <LRY89757@users.noreply.github.com>
Co-authored-by: nihuini <nihuini@tencent.com>
Co-authored-by: nihui <shuizhuyuanluo@126.com>
|
3 years ago |
Fangjun Kuang
|
5281d51535
|
implement GLU and pnnx conversion (#4283)
|
3 years ago |
nihui
|
77eda4c19f
|
implement lstm proj_size (#4263)
|
3 years ago |
miemie2013
|
720f3c9aab
|
Add DeformableConv2D (#4070)
* Add DeformableConv2D
* add unittest and docs
* pnnx torchvision deformconv2d conversion
Co-authored-by: miemie2013 <miemie2013@users.noreply.github.com>
Co-authored-by: nihui <shuizhuyuanluo@126.com>
|
3 years ago |
Zhouzhou
|
4158e63668
|
docs:add sse optimized zh (#4053)
Signed-off-by: Zhouzhou <1197236910@qq.com>
|
3 years ago |
Lry89757
|
ca9abd1c4a
|
Update the add-custom-layer.zh.md (#3741)
1. 🐛Fix Bug of float and int. 修复了std::max()中参数int和float参数不符合的Bug
2. 👀 The structure of ncnn changes. ncnn文件结构变动,所有testlayer.cpp改成在tests/文件夹中
|
4 years ago |
nihui
|
2880eff264
|
deconv1d deconv3d (#3584)
* fix sigmoid returns nan with very large input
|
4 years ago |
nihuini
|
bc188ece58
|
update modelwriter for new operators
|
4 years ago |
nihui
|
c5d7f963b9
|
layer tile (#3491)
|
4 years ago |
nihuini
|
014387dfae
|
update operators doc
|
4 years ago |
nihui
|
3a83704c38
|
binary4d, unary4d (#3443)
|
4 years ago |
FeiGeChuanShu
|
0dea7a04a8
|
fix typo in doc (#3436)
|
4 years ago |
nihuini
|
e8d1d40398
|
update operators doc
|
4 years ago |
nihui
|
f10cc6dd93
|
initial data structure changes for 3dcnn, conv3d, pooling3d (#3378)
Co-authored-by: ElvisYu <elvisyuovo@gmail.com>
Co-authored-by: 余浩文 <m18107220188@163.com>
Co-authored-by: Zr2223 <67497651+Zr2223@users.noreply.github.com>
|
4 years ago |
huoshuai-dot
|
d8665b3687
|
Update how-to-implement-custom-layer-step-by-step.md (#3348)
|
4 years ago |
空言
|
dbf2db26fe
|
Update operators.md (#3345)
change weight type
|
4 years ago |
nihui
|
52721dd1eb
|
Update operators.md
|
4 years ago |
nihui
|
0902594334
|
Update operators.md
|
4 years ago |
nihui
|
a74c64df78
|
Update operators.md
|
4 years ago |
Tijmen Verhulsdonck
|
eaa7e24db6
|
Added ability to switch AVX/AVX2 during runtime (#3076)
|
4 years ago |
nihui
|
4f135e07bf
|
implement convolution1d and pooling1d (#3035)
* implement convolution1d and pooling1d
* add conv1d pool1d test
* fuse convolution1d activation
* update operator doc
* fix vulkan adpative pooling
|
4 years ago |
nihui
|
01be0ee226
|
update operators.md
|
5 years ago |