243 Commits (9376ba71c1251f82994de14cee0a662677037c0b)

Author SHA1 Message Date
  nihui 241524ffce
discard weight memory for x86 arm vulkan (#3865) 4 years ago
  tpoisonooo 6fd801b6d7
feat(src/layer): add vision_transformer benchmark (#3730) 4 years ago
  NaLan ZeYu 5388f9f312
test: fix printf arguments mismatch (#3774) 4 years ago
  nihui f9c1787de9
implement einsum layer and pnnx conversion (#3768) 4 years ago
  nihui ee6402553c
layernorm for vector and mat along w, pnnx convnext end2end test (#3764) 4 years ago
  jasonZhang e62d674e5d
Add unittest and SSE&AVX optimized for BNLL (#3759) 4 years ago
  nihui 308965b7e9 sanitize cooperative matrix option in tests 4 years ago
  nihui 0ea327b557
x86 sse/avx/avx512 optimization for softmax (#3712) 4 years ago
  nihui 131f3d1323
x86 avx512 optimization for convolution winograd pack16to1 and deconvolution family, increase simpleomp argv count (#3694) 4 years ago
  nihui dadc640c66
x86 avx512 optimization (#3581) 4 years ago
  nihui 559e5b23f9
vulkan tensorcore optimization (#3628) 4 years ago
  nihui 002c07d4ec
mix vulkan winograd f23 and f43 (#3639) 4 years ago
  nihui d42e048b56
pnnx convert torch.addmm (#3634) 4 years ago
  nihui 6e19ab26ba
massive vulkan optimization (#3602) 4 years ago
  nihui 2880eff264
deconv1d deconv3d (#3584) 4 years ago
  nihui 920aa79f04
drop x86 avx2 fp16 (#3568) 4 years ago
  nihui d452eca28f
convert torch.matmul, eliminate noop pad and identity op, fuse transpose matmul, fuse select to unbind (#3554) 4 years ago
  Yuzhong Yan 681141ff42
[YZ] Fix bug in unit test (#3556) 4 years ago
  nihui 33e225f173 fix c api test 4 years ago
  nihui c5d7f963b9
layer tile (#3491) 4 years ago
  Xiaohan Liu 3daabd515d
add missing doffset (#3475) 4 years ago
  nihui 922f8b33c1
reduction4d, merge keepdims arg, add test (#3469) 4 years ago
  nihui 3a83704c38
binary4d, unary4d (#3443) 4 years ago
  nihui 6941ec8fc9
arm neon optimization for general packed convolution (#3426) 4 years ago
  nihui 999e640d43
dynamic convolution weight (#3408) 4 years ago
  nihui f98c396e6b
crop4d (#3402) 4 years ago
  nihui cf20dbc0bd
relu3d, batchnorm3d, reshape4d, flatten4d, permute4d (#3397) 4 years ago
  nihui f10cc6dd93
initial data structure changes for 3dcnn, conv3d, pooling3d (#3378) 4 years ago
  nihui 24fbb6e8cb
honor thread setting on load and vulkan command, ci avx512 t4 (#3391) 4 years ago
  nihui f433f86874
fix squeeze expanddims axes and add test (#3359) 4 years ago
  nihui 0b664ec438
fix potential out of range read in test with int8 inputs (#3357) 4 years ago
  nihui 525df8bcc5
rnn/lstm/gru with unequal input output (#3352) 4 years ago
  nihui f448a8f595
implement interp-1d on 2d blob (#3349) 4 years ago
  nihui 5eb4a2ccd0
implement convolutiondepthwise1d (#3342) 4 years ago
  nihui b3a521981b
implement interp cubic aligncorner (#3338) 4 years ago
  nihui aa9753b2f0
detach mat from local blob allocator so net instance could be destroyed much earlier (#3287) 4 years ago
  zhiliu6 814f89ef1a
Fuse HardSwish activation into Convolution and InnerProduct (#3233) 4 years ago
  Tijmen Verhulsdonck 4270b5c502
Fix broken codepaths with AVX only (#3254) 4 years ago
  zhiliu6 80699dd3f9
fix hardswish test beta param (#3214) 4 years ago
  nihui c6cda8d07c
arm neon optimization for requantize leakyrelu (#3144) 4 years ago
  Xavier Hsinyuan 2a5c672787
Add unittest and RVV optimized for SELU (#3114) 4 years ago
  nihuini f1533667ff
fix test_c_api net instance destroyed earlier than blob destruction 4 years ago
  Tijmen Verhulsdonck eaa7e24db6
Added ability to switch AVX/AVX2 during runtime (#3076) 4 years ago
  nihui b413fd3a3d
auto code-format bot and disable restyled (#3075) 4 years ago
  DaydreamCoding f42d0e5dc9
fix warpaffine_bilinear_yuv420sp uv matrix (#3048) 4 years ago
  nihui 4f135e07bf
implement convolution1d and pooling1d (#3035) 4 years ago
  nihuini 12eaa6f9ba update concat test 4 years ago
  nihuini a180bf7bdc update concat test for larger channels 4 years ago
  nihui c1ce8ea84d add more test 5 years ago
  nihuini 07fa2e1fe3 prefer large channels for int8 operator tests 5 years ago