102 Commits (bedf00a5edf85ae8af33bc72ebd85fe325da1271)

Author SHA1 Message Date
  nihui 8f1727ace8 more code coverage 5 years ago
  nihui bb5bfe3841
avx2 infrastructure (#1943) 5 years ago
  nihui 11cffce114
armv8.2 infrastructure (#1856) 5 years ago
  nihui 3ff40b0679
Ci rv32imc (#1940) 5 years ago
  nihui fe6bc1ed4d
Ci rv64gcv and rv64gc (#1936) 5 years ago
  nihui 6538b95102 fix interp and lrn test 5 years ago
  nihuini 2cd8e4d0fa fix floor ceil test with very small numbers 5 years ago
  nihuini c218eee1e0 fix binaryop test ooops 5 years ago
  nihuini 9a38962be2 much longer test ... 5 years ago
  nihuini 0d6cc01d55 innerproduct handle mish activation, fix naive C testing, fix #1930 5 years ago
  nihui be330e0fc4 test mat pixel 5 years ago
  nihuini 5d7b410ca8 test squeezenet with memory pool 5 years ago
  nihui 109e079c51 test deconvolution with output shape and padding 5 years ago
  nihuini 00ef566609 implement full permute tag support for reshape 5 years ago
  nihui 0fdd432fb3
Ci test squeezenet load from binary model and load from memory (#1928) 5 years ago
  nihui 7f5047d1dc
Ci test end2end squeezenet (#1919) 5 years ago
  nihui fb4daa5c96 reshape packing with permute, fix #1909 5 years ago
  Tijmen Verhulsdonck 3325cf94f8
Added AVX swish/lrn/batchnorm (#1897) 5 years ago
  Tijmen Verhulsdonck 73aa99e83c
LSTM arm/x86 + fp16 innerproduct arm (#1881) 5 years ago
  Tijmen Verhulsdonck 26999fab19
Fix AVX wino 3x3 and improve convolution test converage (#1891) 5 years ago
  nihui 12ce58074e some code clean 5 years ago
  Tijmen Verhulsdonck 66618340ac
x86 fp16 weight storage optimizations (#1871) 5 years ago
  Tijmen Verhulsdonck 82637995c1
3x3 winograd elempack8 (#1865) 5 years ago
  nihuini 71db6e1da5 shufflechannel reverse group style 5 years ago
  Tijmen Verhulsdonck d1b5711791
X86 Elempack 8 AVX implementations. (#1853) 6 years ago
  nihuini c38d304369 the implicit gpu instance makes life easier :) 6 years ago
  Tijmen Verhulsdonck a91a18b901
AVX innerproduct and pooling 2x2 versions (#1839) 6 years ago
  nihui 3ef995ed1e
format code style and setup restyled.io (#1840) 6 years ago
  Tijmen Verhulsdonck e3b31511ad
Added AVX implementation to cast to/from bfloat and float32 (#1836) 6 years ago
  Tijmen Verhulsdonck da09e5e7f1
Adding channel padding support for blazeface model. (#1826) 6 years ago
  JackieWu ce2251db05
Improve ROIAlign (accelerate ROIAlign, support sampling ratio and aligned ROIAlign) (#1820) 6 years ago
  nihui 8fec0038ba fix ci test 6 years ago
  nihuini 4a624c636b skip image tests on unsupported platforms 6 years ago
  zhiliu6 d23cef320c
Add Swish layer (#1799) 6 years ago
  nihui 15a4b2c878 test pad same mode 6 years ago
  nihuini 0efcf63f51 mat pixel rotate test 6 years ago
  nihuini ebabfa60c1 disable image storage test on macos and ios 6 years ago
  zhiliu6 3bfabf1d6a
Add fused convolution and mish layer support. (#1761) 6 years ago
  nihui 9c0e46b00a priorbox test fix 6 years ago
  nihui 52c2782922 priorbox test 6 years ago
  nihuini f350c96112 memorydata vulkan 6 years ago
  nihui f9332e04e4
enable image storage test (#1744) 6 years ago
  nihuini b9e9b99e56 reuse device packing and unpacking, noop test, fix packing test 6 years ago
  nihui 9a9a618229 image storage is mandatory, less options makes life easier 6 years ago
  nihui e8688b042f fuse packing cast storage, binaryop image shader, dummy buffer and image, device-wide utility packing converter operators, fix multi-blob layer test 6 years ago
  zhiliu6 bd55ddcf0d
Add mish layer (#1733) 6 years ago
  nihui 62da1228e1
adreno image shader + fp16 + fp16a (#1714) 6 years ago
  nihuini b2d9325c0d test activation fusion 6 years ago
  nihuini 956ab49d02 fix conv1x1s1 pack4to1 bf16s 6 years ago
  nihuini 36f6942fa0 testing time is too long ... 6 years ago