190 Commits (49f3e1ea098f6d2930a0f9f980bd23bfdfd9fe02)

Author SHA1 Message Date
  nihui 49f3e1ea09
drawing api and stb_image (#2913) 5 years ago
  nihui 17936e9f54 fix packing risc-v test, add cpu_riscv_vlenb() 5 years ago
  nihui a61f03ec76 arm neon optimization for pixelshuffle scale 2 5 years ago
  nihuini d6b2ea5aac arm neon optimization for convolution 3x3 on small channels 5 years ago
  nihui 7e1aaa5828
cmake option NCNN_INT8 (#2839) 5 years ago
  nihui 66455c1b95
implement 2823 binary broadcasting type (#2827) 5 years ago
  nihuini 41a4bea954 unroll size 8 for conv3x3s1 pack8to1 int8 arm64 5 years ago
  nihui e9cc637573
arm neon optimization for int8 packing kernels (#2809) 5 years ago
  nihui 1ea8bfbd2e x86 avx2 conv3x3s1 pack8 direct optimization, fix #2789 5 years ago
  ncnnnnn 6e6cb9f4f3
simple sort ncnn_add_layer_test (#2790) 5 years ago
  nihui a48bf43ef7 test conv/fc int8 with activation 5 years ago
  nihui 5fe75f19ef
architecture changes for int8 packing (#2771) 5 years ago
  nihuini 15d63ec0f5 fuse onnx multiheadattention with same qkv blob 5 years ago
  RBelogorodtsevFBase 1212ed6e94
implements gelu activation (#2749) 5 years ago
  nihuini c17eb4e208 multiheadattention layer 5 years ago
  nihuini 7ac23ab34d fuse onnx layernorm, fix 2-dim layernorm implementation, add test 5 years ago
  nihui 3c92a1184b
arm neon optimization for general convolution im2col sgemm (#2668) 5 years ago
  nihui ab56083ca5
arm neon optimization for conv3x3s1 winograd42 (#2664) 5 years ago
  nihuini f437bcdd4c enable fp16s and int8s on newer adreno/mali, actually enable int8 tests 5 years ago
  nihui 74451897cb
handle gemm in innerproduct (#2607) 5 years ago
  nihui 0a59ac9b16
integer warpaffine (#2604) 5 years ago
  nihui 6672b09a37
arm neon optimization for gru (#2597) 5 years ago
  nihui 0b35540c72
arm neon optimization for lstm (#2595) 5 years ago
  nihuini 3915b5d496 arm neon optimization for packing fp16/bf16 pack8 family 5 years ago
  nihui fca04980f3
enhance padding test (#2580) 5 years ago
  nihui 80fdddb502 more slice test 5 years ago
  nihui ef3550b52f
gru and rnn layer (#2572) 5 years ago
  Guoxia Wang 609f63c57e
support PyTorch AdaptiveAvgPool2d and AdaptiveMaxPool2d (#2546) 5 years ago
  nihui 21dc650eb3
check layer support (#2564) 5 years ago
  tpoisonooo baf49574c4
innerproduct aarch64 use gemm (#2521) 5 years ago
  nihui 54c0a13b9f
build shared library (#2525) 5 years ago
  nihuini fbf0ffda53 pixelshuffle nhwc mode, convert onnx DepthToSpace mode DCR, convert mlir tf.DepthToSpace 5 years ago
  nihuini b35b06be6d reorg nhwc mode, code format 5 years ago
  nihui 1040f40c8b update c api for custom allocator datareader modelbin and layer registration, add cookie userdata to layer 5 years ago
  nihui 2b7b92b726 update c api allocator 5 years ago
  nihui 017440c1ca update c api allocator 5 years ago
  nihuini 27e9795198 update c api 5 years ago
  nihuini 4114d333c9 low-level op api for C api 5 years ago
  nihuini bd4f1ccb07 eltwise for vec and image, fix #2473 5 years ago
  Evgeny Proydakov cce6b59556
More test for cpu (#2439) 5 years ago
  nihuini 3fcd44cf99 fix interp on vec 5 years ago
  nihui be49c07e93 fix arm82 fp16s crop padding 5 years ago
  nihui e68f15d2f0 padding vulkan vec and image, more padding test 5 years ago
  Leighton Choi 44518f457a
Support negative axis in concat, slice and softmax (#2365) 5 years ago
  nihuini 27d36cc804 crop test for reference blob 5 years ago
  nihuini 2c02bfb567 crop vulkan vec and image, crop x86 pack4, more crop tests, fix crop with channel tail offset 5 years ago
  nihui 71272d1b99 more padding test 5 years ago
  PENGUINLIONG 8f8f2de4d0
SSE2 optimization pack (#2123) 5 years ago
  Leo cab255f107
Fix compile failure when NCNN_PIXEL off (#2260) 5 years ago
  Evgeny Proydakov a9cd60a995
Added unittest for cpu module. Improved code coverage. (#2311) 5 years ago