nihui
9376ba71c1
less unroll for unaryop arm, fix padding arm warning
4 years ago
nihui
7886e90c65
split arm82 source for smaller binary and memory footprint ( #3877 )
* split arm82 source, wip
* check compiler arm82 only for arm 64bit target
* drop arm82 registery
* strict check compiler support arm82
4 years ago
nihui
426e564b6e
general simd optimization for convolution1d ( #3404 )
4 years ago
nihui
f10cc6dd93
initial data structure changes for 3dcnn, conv3d, pooling3d ( #3378 )
Co-authored-by: ElvisYu <elvisyuovo@gmail.com>
Co-authored-by: 余浩文 <m18107220188@163.com>
Co-authored-by: Zr2223 <67497651+Zr2223@users.noreply.github.com>
4 years ago
nihui
cdf45a6512
cmake option NCNN_BF16 ( #3068 )
4 years ago
nihui
5fe75f19ef
architecture changes for int8 packing ( #2771 )
* quantize and dequantize tests
* unify activation and usability function
* drop NCNN_REQUANT cmake option, test dequantize requantize pack8, fix webassembly build
* benchmark use requantize int8 model
5 years ago
nihui
be49c07e93
fix arm82 fp16s crop padding
5 years ago
nihui
e68f15d2f0
padding vulkan vec and image, more padding test
5 years ago
nihuini
dbb761b9a4
binaryop eltwise padding pooling arm fp16sa pack8
5 years ago
nihuini
f23122bb3f
since fp16 storage option is on by default, upper-level function may pass fp32 storage with default option, guard with element bits checking
5 years ago
nihuini
47ae0c151a
some shared arm bf16s fp16s implementation
5 years ago
nihui
b5e288b521
layer creator function is not necessary for built-in layers
5 years ago
nihui
3ef995ed1e
format code style and setup restyled.io ( #1840 )
6 years ago
Tijmen Verhulsdonck
da09e5e7f1
Adding channel padding support for blazeface model. ( #1826 )
* Add channel padding and blazeface model support.
* remove python binding
* remove std::min usage
* fix reference blob usage
* Increased padding test coverage
* implement requested changes
6 years ago
nihui
b7c82fcc45
code clean, concat bf16s
6 years ago
nihui
7d1eec3d5d
the use_bf16_storage option
6 years ago
nihui
2c3b70f8b2
else if is ugly :D
6 years ago
nihui
d599791f59
padding pack4 bf16s neon kernel
6 years ago
nihui
5042d14d7d
define sfpvec8 afpvec8 macro, use modern glsl extension for fp16 arithmetic, fix padding aarch64 build
6 years ago
nihui
ebed350c48
add flatten test, fix build on armv7
6 years ago
nihui
4cd80f6636
fix padding arm pack4 per_channel_pad
6 years ago
nihuini
c78b8a08fd
padding reflect pack4, add padding test
6 years ago
nihuini
a50bcf10aa
per channel pad
6 years ago
nihui
a1bd88fb4a
arm neon assembly optimization for padding constant pack4
6 years ago
nihui
02c811e829
fix padding pack4 elempack
6 years ago
nihui
394f6786b9
neon enable support_packing
6 years ago
nihuini
c4f23ae8ad
rename Mat packing to elempack
6 years ago
nihuini
082aa2489d
packing padding support packing
7 years ago