nihuini
|
4e3df863d5
|
fix enable feature pointer
|
7 years ago |
nihuini
|
46dc21c8b1
|
fp16 shader
|
7 years ago |
Gemfield
|
add8c73922
|
Fix the return value of load_param and load_model (#855)
|
7 years ago |
nihuini
|
37573aeeb5
|
remove unused record download
|
7 years ago |
nihuini
|
05bf09ba70
|
rename fp16_storage to support_fp16_storage
|
7 years ago |
nihuini
|
43737b378f
|
wrapper function for converting between fp32 and fp16
|
7 years ago |
Gemfield
|
506d9f74aa
|
eliminate the warning msg when compile the onnx2ncnn (#850)
|
7 years ago |
nihuini
|
2b8ff843e9
|
cast layer and shader for fp32 fp16 conversion
|
7 years ago |
Gemfield
|
573c2bcd93
|
Fix crash issue during load_model (#848)
* Fix crash issue during load_model
* Fix crash issue during load_model 2nd part
|
7 years ago |
nihuini
|
a3a2548aa2
|
initial fp16s fp16a shader build system
|
7 years ago |
nihuini
|
332722af63
|
fix fp16a int8a exchange oops
|
7 years ago |
nihuini
|
e59dc6fafe
|
proper usage of instance extension VK_KHR_get_physical_device_properties2, check fp16 and int8 feature
|
7 years ago |
Gemfield
|
d91d7419eb
|
Add upsample linear mode (#846)
|
7 years ago |
nihui
|
caeb85d6cd
|
multithreaded pipeline creation and destruction may cause driver crash :(
|
7 years ago |
Gemfield
|
5407deb5ce
|
Apply splitcnn function to graph input (#844)
|
7 years ago |
nihui
|
dbf20520a2
|
Update README.md
|
7 years ago |
nihuini
|
20fb006282
|
coverage never works without proper unittest
|
7 years ago |
Abdel Younes
|
e9ac5f207f
|
add: cmake option to install NCNN SDK (#841)
A project using src/CMakeLists.txt directly does not want
to install NCNN library and headers. This new option makes it
optional (default to true).
|
7 years ago |
Gemfield
|
582ea371ce
|
Enhance upsample scale value retrieval with raw_data API (#839)
|
7 years ago |
nihuini
|
3b404dcedd
|
fuse transpose(weight)+matmul to matmul, fix #620
|
7 years ago |
nihuini
|
b2e41bf83d
|
fallback convolution to cpu path for pad -233
|
7 years ago |
nihuini
|
d933f384b6
|
bump engine version
|
7 years ago |
nihuini
|
038389fa63
|
blacklist known buggy driver
|
7 years ago |
nihuini
|
593715da88
|
update new cctools-port path
|
7 years ago |
nihuini
|
92f221b554
|
update script for vulkan-enabled build, add ios 64bit-only toolchain file
|
7 years ago |
nihuini
|
c1c72ec1b7
|
add gpu button for squeezencnn sample, require api android-24 and ndk-r18b
|
7 years ago |
nihuini
|
d999f43b87
|
fix vulkan initialization using memory loading
|
7 years ago |
nihuini
|
d263cd507c
|
gpu packing and unpacking
|
7 years ago |
nihuini
|
806911a549
|
packing vec and image shader
|
7 years ago |
BUG1989
|
2f4c4a8202
|
fix the compile error when using armv7a without neon (#835)
|
7 years ago |
nihuini
|
6f9ffca7e3
|
fix crop on channel dim only, fix #797, fix #831
|
7 years ago |
nihuini
|
e811778a4a
|
trival out-of-index fix
|
7 years ago |
nihuini
|
76638aebf9
|
fix build on msvc
|
7 years ago |
nihuini
|
f89c5b9c31
|
convert mxnet UpSampling, convert onnx Upsample, fix #753
|
7 years ago |
nihuini
|
d9301c4f59
|
convert mxnet crop slice step1, convert onnx slice step1, fix reduction dims 2, fix #441, fix #498, fix #519
|
7 years ago |
BUG1989
|
ff38053321
|
[WIP] arm64-v8a int8 optimization (#823)
* requantize layer arm64-v8a neon implement
* convdw3x3s1 arm64-v8a neon implement
* convdw3x3s2 arm64-v8a neon implement
* conv1x1s1 arm64-v8a is optimized by neon assembly
* conv sgemm int8 optimized with neon assembly,kernel transform is offline
* conv conv winograd int8 optimized with neon assembly,fix ci build failed
* conv3x3s2 int8 arm64-v8a optimized with neon assembly,remove old codes.
|
7 years ago |
nihuini
|
d3a11eb6c9
|
one codepath for unified and discrete device
|
7 years ago |
nihuini
|
433a92401a
|
auto barrier in pipeline and copy command
|
7 years ago |
nihuini
|
2672cd437f
|
add layer type index member
|
7 years ago |
黄涛
|
9f0ae5c779
|
fixed op <InstanceNormalization> (#829)
|
7 years ago |
nihuini
|
26e64cac41
|
use number instead of enum for compability with opencv 2.x and opencv 4.x
|
7 years ago |
nihuini
|
ce65edcc84
|
fix flatten pack1to4
|
7 years ago |
nihuini
|
5646b7d2c2
|
flatten image
|
7 years ago |
nihuini
|
1f4bdd91b5
|
uint32_t typed workgroup size
|
7 years ago |
nihuini
|
2e939fab0f
|
fix memleak
|
7 years ago |
nihuini
|
532054b453
|
expose more device info
|
7 years ago |
nihui
|
dd83284cee
|
prelu shader
|
7 years ago |
nihui
|
e50b339f04
|
clip shader
|
7 years ago |
nihui
|
69788b0467
|
reshape shader family
|
7 years ago |
nihui
|
c41bcd98a3
|
priorbox shader, fix permute order 1 on image, fix potential staging memory leak
|
7 years ago |