2399 Commits (08ecc94d63b76df4afe2134f64876a2fadacecda)
 

Author SHA1 Message Date
  Kagurazaka Kotori 08ecc94d63
x86: Use _mm_cvtsi128_si{32,64} in float2int8 (#3536) 4 years ago
  nihui 1d0b78f9b6
Update README.md 4 years ago
  nihui a356d152bb
Update README.md 4 years ago
  Joson 70795c6548
Create README.md (#3532) 4 years ago
  teng 3ff9ae707f
simplify macro (#3530) 4 years ago
  Kagurazaka Kotori 5c078016c2
x86/avx_mathfun.h: Remove fallback warnings (#3527) 4 years ago
  nihui 2d46994d2e wrap avxvnni and avx512vnni build options over cpu feature detector 4 years ago
  nihui 33e225f173 fix c api test 4 years ago
  nihui bae2ee375f simplify c api layer forward_n output array type 4 years ago
  nihuini 1be043aad5
convert torch mean/sum/prod reduction with no args 4 years ago
  nihuini b4a755495c
convert pnnx zeros roll remainder 4 years ago
  nihui c0a94cd9ca
fix armv7 without neon (#3514) 4 years ago
  nihuini 4ba1eb6d2f
assign unique names for all pnnx operator and operand names. fix #3493 4 years ago
  nihuini 457f7d1c63
fix use-after-free, fix #3492 4 years ago
  nihui b07ad54320 add zynq-7020 benchmark 4 years ago
  nihui 4e4e0b9cf8 do not link libgcc as we no longer rely on builtin support cpu feature intrinsics now 4 years ago
  nihui 71f377e9e9 update benchmark from Q-engineering 4 years ago
  nihui d95213a005
x86 convolution int8 optimization third stage (#3506) 4 years ago
  nihuini 9f7f491885
use the old-style __cpuid_count for old compiler compatibility, fix #3510 4 years ago
  nihui 930c36ebe2
avx512 infrastructure (#3407) 4 years ago
  nihui c2896bcd4d
x86 convolution int8 optimization second stage (#3495) 4 years ago
  teng 13a51fbcf8
add else (#3494) 4 years ago
  nihui e9b8f0a6ef
x86 avx2 optimization for convolution gemm int8 (#3489) 4 years ago
  nihui c5d7f963b9
layer tile (#3491) 4 years ago
  dependabot[bot] d25388c938
Bump pypa/gh-action-pypi-publish from 1.4.2 to 1.5.0 (#3490) 4 years ago
  Xiaoyang Chen 4d31c46532
[pnnx] Update README.md (#3487) 4 years ago
  nihui 7d3503c06a
pnnx Tensor index (#3483) 4 years ago
  nihuini 1db16ce9fc
pnnx torch norm stack test 4 years ago
  nihuini 23d3340017
pnnx norm stack 4 years ago
  nihuini e33bdd16e8
pnnx fuse conv1d-bn convtranspose1d-bn 4 years ago
  nihuini f8ca1e7585
fix pnnx crash on unsupported expression 4 years ago
  nihui 7c60dc2db7
pnnx roialign (#3478) 4 years ago
  nihui 143258e317
pnnx torchvision deformconv2d (#3459) 4 years ago
  Xiaohan Liu 3daabd515d
add missing doffset (#3475) 4 years ago
  nihui 7b222a19af
update benchmark (#3465) 4 years ago
  dog-qiuqiu 009d607a15
add the param file of yolo-fastest in benchmark (#3470) 4 years ago
  nihuini 014387dfae
update operators doc 4 years ago
  nihuini de436f9e26
pnnx arange matmul zeros_like expand_as 4 years ago
  nihui 922f8b33c1
reduction4d, merge keepdims arg, add test (#3469) 4 years ago
  nihui bbb450f256
disable tools and examples building in python ci (#3463) 4 years ago
  nihui 713e712ba6
fix slow fp32/int32 crop on arm82 (#3462) 4 years ago
  nihui 2d98c86ecd
branch less mat channel (#3452) 4 years ago
  nihui 3a83704c38
binary4d, unary4d (#3443) 4 years ago
  Zhuo Zhang 880e2805fe
add c906 v223 toolchain (#3449) 4 years ago
  nihuini 42017e4f98
convert pnnx 4d attribute 4 years ago
  nihuini f02b259a15
convert some pnnx reduction family 4 years ago
  nihuini c5efaf02b7
unique attribute identifier 4 years ago
  nihuini 0532190499
convert pnnx expand var 4 years ago
  nihuini e0124db194
convert pnnx clone normal 4 years ago
  nihuini 42e7160950
eliminate pnnx dropout 4 years ago