nihui
7655b9e4e9
fix build on armv7 again ...
6 years ago
nihui
a97439988f
fix build on armv7
6 years ago
nihui
c013bd9b7e
vulkan convolution winograd f63
6 years ago
nihuini
81a5dfe76b
general convolution and convolutiondepthwise arm neon pack4, wip
6 years ago
nihuini
834224fea8
new option use_packing_layout
6 years ago
Natsu
6d1944f2c3
CMake improvement ( #1115 )
* CMake improvement
* Fix bugs
* Fix typo
* Propagate vulkan dependency
* import vulkan
* add config files, now exported target cmake should be able to find packages
* Propagate no-rtti and no-exception
* Provide a option to control rtti and exception in mobile platform
* Make cmake clean
* Resolve conflicts
* Update CMake
PIE is propagated by INTERFACE_POSITION_INDEPENDENT_CODE
* Remove bad things
6 years ago
nihuini
eced9c81c6
fix crash on x86 conv7x7s1 dilation > 1, fix #1110 , fix #1117
6 years ago
tpoisonooo
1ca4387c9c
Auto choose conv implementation ( #1085 )
* add relative README_CN.md;
* obtain time cost with op->forward().
6 years ago
nihuini
e9c890ad8a
respect winograd top blob allocator when out size is exactly the one before cut border, fix #1118
6 years ago
wincle
fb9309e460
implement reflect padding.
6 years ago
wincle
d1a1a0dbcd
fix onnx2ncnn padding bug. fix #1103
6 years ago
wincle
bb6b93568d
Tanh Support
6 years ago
Arnaud Brejeon
e30464cdba
Fix bug #1093 ( #1094 )
6 years ago
nihui
a7490fd71a
fix #1107
6 years ago
nihui
8c53706987
net vkdev getter api
7 years ago
Christopher
6cfd09b429
add toolchains hisiv600toolchain for hi3559V100 ( #1090 )
7 years ago
nihui
5ce6ae90be
a portable thread class ( #1081 )
* a portable thread class
* Update platform.h.in
* Update platform.h.in
7 years ago
daquexian
dbe8a32082
Add onnx resize op
7 years ago
nihuini
c769437533
fix fp16p deconvolution and convolution-typed innerproduct
7 years ago
xue
cb14d1bbf3
x86模式下不使用sse时卷积越界的bug
7 years ago
hanson.young
698acd28f6
修复在window x64下AVX2编译的问题 ( #1076 )
* fix windows avx2 error in convolution_3x3.h and convolution_sgemm.h
* fix compatibility of avx2 under Linux and Windows
* Update CMakeLists.txt
Space instead of tab
* Update convolution_3x3.h
Space instead of tab
* Update convolution_sgemm.h
Space instead of tab
7 years ago
nihuini
6b933c6dc4
a portable condition variable class
7 years ago
BUG1989
bcfe9f453f
initial the ncnn post training quantization tools ( #1067 )
* initial the ncnn post training quantization tools
* clear some comments of tools
* fix the Travis ci compiler error
7 years ago
nihuini
b25f76833a
restore per extractor allocator setters, patially revert e09607bc22
7 years ago
nihuini
21b5508c96
shared locked vkallocator cannot prevent concurrent accessing during actual gpu inference, use seperated vkallocator for each queue
7 years ago
nihui
7e289bbd23
Update README.md
7 years ago
BUG1989
06a63467e9
update qcom675 benchmark ( #1062 )
7 years ago
BUG1989
bb0d8360dc
update RK3399 benchmark ( #1059 )
7 years ago
BUG1989
ae1b69ec53
update some comments of int8 contributors ( #1054 )
7 years ago
Howave
0bd8ba9505
update Snapdragon 835 benchmark (Xiaomi 6 with non performance mode) ( #1055 )
7 years ago
Jingdong Li
f7e4155ee3
implemented selu activation ( #1051 )
7 years ago
nihuini
082aa2489d
packing padding support packing
7 years ago
nihuini
4d5c6a00fb
absval batchnorm clip relu sigmoid support packing
7 years ago
nihuini
6935b78926
new layer attribute support_packing
7 years ago
nihuini
51ef60e71b
fix hardsigmoid
7 years ago
Howave
fed22ca4ed
fix neon intrinsic compile warnings ( #1045 )
7 years ago
PENGUINLIONG
084053fed8
Implemented hard sigmoid ( #1046 )
* Implementation for hard sigmoid
Not yet tested.
* Resolve requests
7 years ago
nihui
affac77b6f
add info for brocolli
7 years ago
Howave
123ca35e00
fix compile warnings ( #1042 )
7 years ago
nihuini
bade132589
comment++
7 years ago
nihuini
81be8c86ae
fix bus error in resize_bilinear_c2 on armv7
7 years ago
nihuini
17d63a1491
fix bus error in resize_bilinear_c3 on armv7
7 years ago
nihuini
e9ffdb5bdd
16bit storage on arm mali is buggy
7 years ago
nihui
1273d69c20
update qcom410 imx7d benchmark
7 years ago
nihuini
73911492d7
fix validation warning on querypool destruction, enable fp16p by default
7 years ago
nihuini
040a8d2427
set vulkan device by gpu index
7 years ago
nihuini
9f9ac56538
update qcom810 and iphone5s benchmark
7 years ago
nihui
21f79b8546
prefer cpu fp16 casting to reduce upload/download overhead on discrete gpu
7 years ago
nihui
af950819cd
convert add_n and ElementWiseSum, fix #1008
7 years ago
nihui
721abe91a8
packed mat is handy
7 years ago