48 Commits (72ef77a469cd8a8d3cd1dd84d0ba82c6423d7f21)

Author SHA1 Message Date
  nihui a61f03ec76 arm neon optimization for pixelshuffle scale 2 5 years ago
  nihui 5fe75f19ef
architecture changes for int8 packing (#2771) 5 years ago
  nihui 21dc650eb3
check layer support (#2564) 5 years ago
  tpoisonooo baf49574c4
innerproduct aarch64 use gemm (#2521) 5 years ago
  nihui 54c0a13b9f
build shared library (#2525) 5 years ago
  PENGUINLIONG 8f8f2de4d0
SSE2 optimization pack (#2123) 5 years ago
  maxfy1992 a106baa3b8
add interp param align_corner (#2236) 5 years ago
  Leo 5afd318b86
Support remove libstdc++ denpendency (#2030) 5 years ago
  nihuini d3f0b9f993 try smaller random values 5 years ago
  nihuini 5d5a3d1434 conv1x1s1 conv1x1s2 conv3x3s1 winograd pack8 arm fp16sa 5 years ago
  nihui aa1a9e90c5 interp shufflechannel arm fp16sa pack8 5 years ago
  nihuini df5a7f32d4 enable arm82 fp16sa pack8 test 5 years ago
  nihuini 47ae0c151a some shared arm bf16s fp16s implementation 5 years ago
  nihui bb5bfe3841
avx2 infrastructure (#1943) 5 years ago
  nihui 11cffce114
armv8.2 infrastructure (#1856) 5 years ago
  nihui 3ff40b0679
Ci rv32imc (#1940) 5 years ago
  nihuini 0d6cc01d55 innerproduct handle mish activation, fix naive C testing, fix #1930 5 years ago
  Tijmen Verhulsdonck 3325cf94f8
Added AVX swish/lrn/batchnorm (#1897) 6 years ago
  Tijmen Verhulsdonck 73aa99e83c
LSTM arm/x86 + fp16 innerproduct arm (#1881) 6 years ago
  nihui 12ce58074e some code clean 6 years ago
  Tijmen Verhulsdonck 66618340ac
x86 fp16 weight storage optimizations (#1871) 6 years ago
  Tijmen Verhulsdonck d1b5711791
X86 Elempack 8 AVX implementations. (#1853) 6 years ago
  nihuini c38d304369 the implicit gpu instance makes life easier :) 6 years ago
  nihui 3ef995ed1e
format code style and setup restyled.io (#1840) 6 years ago
  nihuini ebabfa60c1 disable image storage test on macos and ios 6 years ago
  nihui f9332e04e4
enable image storage test (#1744) 6 years ago
  nihui 9a9a618229 image storage is mandatory, less options makes life easier 6 years ago
  nihui e8688b042f fuse packing cast storage, binaryop image shader, dummy buffer and image, device-wide utility packing converter operators, fix multi-blob layer test 6 years ago
  nihui 62da1228e1
adreno image shader + fp16 + fp16a (#1714) 6 years ago
  nihui 18328f63e6 fix arm bf16 test conditions, fix unused warning in crop arm 6 years ago
  nihui 7365bb80a2
vkmat and command api breaks (#1689) 6 years ago
  nihuini 9f3af60b3a dropout prelu scale test 6 years ago
  nihuini 85d5e5d3e4 fix innerproduct vulkan pack8 and arm neon, disable packing_layout for int8 test 6 years ago
  nihui ec40b4dbd7
test bf16s (#1644) 6 years ago
  nihui d023137426
test fp16 packed and shader pack8 option (#1636) 6 years ago
  nihui 2fa22dc2be if layer do not support vulkan, pass the test 6 years ago
  nihuini 648ef3fdee reuse vkallocator in test 6 years ago
  nihui 0f7e7bca02
shader shape specialization constant and basic local group size partition (#1523) 6 years ago
  nihui a718129d76 shader pack8 option works 6 years ago
  nihuini f813222c1a template candy 6 years ago
  tpoisonooo 7168829f06 Fix int8 requant (#1499) 6 years ago
  tpoisonooo d9018b0989 Fix chgemm (#1480) 6 years ago
  nihui 4914fc7a3d pretty test error output 6 years ago
  nihui 3f987808ca add packing test 6 years ago
  nihuini cc006f230a fix test concat crash 6 years ago
  nihuini e25eb87cfd fix concat image vulkan, add concat test 6 years ago
  nihui 6f2ef1932d int8 code refactoring wip, add int8 test 6 years ago
  nihui 038666e049
the initial auto test (#1464) 6 years ago