nihui
ef3550b52f
gru and rnn layer ( #2572 )
5 years ago
Guoxia Wang
13d0bfcfd0
fix onnx2ncnn adaptive pool conversion bug, remove no used weights node ( #2570 )
5 years ago
ncnnnnn
f1c19c1090
pooling_x86_unused_opt ( #2565 )
5 years ago
Guoxia Wang
609f63c57e
support PyTorch AdaptiveAvgPool2d and AdaptiveMaxPool2d ( #2546 )
* support pytorch adaptive pool
* support onnx2ncnn adaptive pool convert
* support ncnnoptimize adaptive pool param write
* fix adaptive pool out_shape order
* fix adaptive pool out_shape order, H and W can be either a int
add test case, set support_vulkan = false Pooling_vulkan::create_pipeline
* review adaptive pool
* fix typo
* add adaptive pool forward in pooling_x86.cpp pooling_arm.cpp
fix out_w, out_h id naming convention
* fix typo
* don't support packing, bf16, int8, image for adaptive pool
* Restyled by clang-format
* Restyled by astyle
* Restyled by clang-format
* Restyled by astyle
Co-authored-by: Restyled.io <commits@restyled.io>
5 years ago
Zhuo Zhang
aac39e917f
fix mingw-x64 compilation for PC SIMD ( #2554 )
5 years ago
tpoisonooo
baf49574c4
innerproduct aarch64 use gemm ( #2521 )
* perf(innerproduct-arm): add aarch64 gemm
* fix(innerproduct): fix compilation errror
* fix(armv7-innerproduct): fix armv7 compilation error
* fix(innerproduct): fix gemm param
* fix(int8): update mock scales and fix runtime error
* fix(compilation): fix compilation error
5 years ago
nihui
54c0a13b9f
build shared library ( #2525 )
* build shared lib and enable lto
* reserved for layer and option
* allocator pimpl
* datareader pimpl
* paramdict pimpl, disable copy assign for allocator and datareader
* modelbin pimpl
* net extractor pimpl
* gpu pimple
* disable copy assign vulkandevice, code format
* command pimpl, dummy image readonly
* pipeline pipelinecache pimpl, export platform class
* code format, export simple family
* update ci
* disable lto on android armv7, merge webassembly ci
* link libgcc, fix macos dylib version
* pipeline pimpl, gpu info pimpl
* destroy gpu info after vulkan device
* ignore msvc stl class warning
* fix ncnn_paramdict_get_float return type
* fix vktransfer upload fp16 without flatten, add command test
5 years ago
nihuini
fbf0ffda53
pixelshuffle nhwc mode, convert onnx DepthToSpace mode DCR, convert mlir tf.DepthToSpace
5 years ago
nihuini
b35b06be6d
reorg nhwc mode, code format
5 years ago
nihui
82c4acc187
conv1x1s1 and packing pack4 x86 optimization, fix #2510 fix #2509
5 years ago
nihui
c324149709
ci vs2015 ( #2492 )
* add vs2015 ci
* build x86 binary
5 years ago
nihuini
bd4f1ccb07
eltwise for vec and image, fix #2473
5 years ago
Zhuo Zhang
a1e9993616
fix convolution_arm.cpp shadowed variables warning ( #2448 )
5 years ago
Zhuo Zhang
a3642fd4a9
fix convolution.cpp shadowed variables warning ( #2449 )
5 years ago
Zhuo Zhang
782252f474
fix eltwise_x86*.cpp-shadowed-variables-warning ( #2445 )
5 years ago
Zhuo Zhang
f13035794a
fix convolution_x86*.cpp-shadowed-variables-warning ( #2444 )
5 years ago
nihuini
1a3191e245
fix libncnn build with gcc-4.8 and gcc-4.4, fix #2388
5 years ago
ncnnnnn
0fa902bcf8
remove vs c4305 ( #2423 )
5 years ago
zhiliu6
25b224479c
optimize left over x86 convolution ( #2378 )
5 years ago
nihui
a071637064
optional sse2 ( #2373 )
5 years ago
Zhuo Zhang
e07faf3403
change abs() to fabs() for float args on ARM ( #2374 )
For compatiblity for old compilers such as ndk-r16b.
`fabs()` keeps result stable.
ref: https://github.com/Tencent/ncnn/issues/2372
5 years ago
nihuini
3fcd44cf99
fix interp on vec
5 years ago
nihui
be49c07e93
fix arm82 fp16s crop padding
5 years ago
nihui
e68f15d2f0
padding vulkan vec and image, more padding test
5 years ago
Leighton Choi
44518f457a
Support negative axis in concat, slice and softmax ( #2365 )
5 years ago
nihuini
2c02bfb567
crop vulkan vec and image, crop x86 pack4, more crop tests, fix crop with channel tail offset
5 years ago
nihui
ba364791e8
reshape x86 pack4
5 years ago
nihui
46565025b6
x86 slice pack4
5 years ago
nihuini
fb59eeb4c2
fix bias x86 sse2 code style
5 years ago
nihui
b896c4c33c
add some x86 pack4
5 years ago
PENGUINLIONG
8f8f2de4d0
SSE2 optimization pack ( #2123 )
* SSE2: BatchNorm
* Fixed batch norm in AVX configuration
* Optimized register size switch
* Attempt to pass CI
* Attempt to pass CI
* Bias op
* Element wise ops
* Support packing on x86 by default
* Fixed macro range in bias
* Use aligned read for packed data
* Update testutil.h
* Update pooling_x86.cpp
* Support wasn SIMD
* Fix emscripten compiler flags
* fix build
* more ci fix
* concat x86 pack4
* flatten x86 pack4
* more x86 pack4
* ci pass
* fix
* enable sse2 mathfun
* enable --experimental-wasm-simd
Co-authored-by: nihui <shuizhuyuanluo@126.com>
Co-authored-by: nihuini <nihuini@tencent.com>
5 years ago
maxfy1992
a106baa3b8
add interp param align_corner ( #2236 )
* add interp param align_corner
add check support_vulkan after create_pipeline for tests
* code style
Co-authored-by: yangfengmax <yangfengmax@didichuxing.com>
5 years ago
Philipp Zabel
2b4b7cabe8
Fix maybe uninitialized warnings in Vulkan reshape functions ( #2255 )
5 years ago
Evgeny Proydakov
9dbe7aa897
Fixed compile warning for ios clang build. [-Wuninitialized] ( #2244 )
5 years ago
nihui
cf3cf83cd3
unified image shader storage type ( #2231 )
* drop bug_layout_binding_id_alias flag
5 years ago
Evgeny Proydakov
a0c9249ffe
Fixed compile warning in convolutiondepthwise for ios cpu 32 bit. [-Wunused-variable] ( #2227 )
5 years ago
Evgeny Proydakov
2cfec9032b
Fixed compile warnings for ios clang cpu 32bit. [-Wunused-variable] ( #2223 )
5 years ago
Evgeny Proydakov
6a71ccfc3f
Fixed compile warnings for clang ios gpu. [-Wunused-variable] ( #2217 )
5 years ago
Evgeny Proydakov
f1dc8a1825
Fixed compile warnings for gcc linux gpu 64 build. [-Wunused-parameter] ( #2216 )
* Fixed compile warnings for gcc linux gpu 64 build. [-Wunused-parameter]
* Fixed linux x64 gpu clang compile warning. [-Wunused-variable]
5 years ago
Zhuo Zhang
fc9541a2b5
fix packing.cpp integer multiply may overflow ( #2213 )
* fix packing.cpp integer multiply may overflow
* int->size_t
5 years ago
Evgeny Proydakov
2b66348b62
Fixed compile warnings for gcc linux gpu 64 build. [-Wunused-parameter] [-Wunused-variable] ( #2215 )
5 years ago
Zhuo Zhang
8f7937affa
fix shufflechannel.cpp integer multiply may overflow ( #2212 )
5 years ago
Zhuo Zhang
961af15089
fix roialign_x86.cpp integer multiply may overflow ( #2211 )
5 years ago
ncnnnnn
5a91a640cb
fix -Wmaybe-uninitialized _coffset ( #2204 )
5 years ago
restyled-io[bot]
2d459830a0
Restyle fix -Wmaybe-uninitialized ( #2203 )
* fix -Wmaybe-uninitialized
build warning: ‘_outh’ '_outw' may be used uninitialized
* Restyled by clang-format
* Restyled by astyle
* Restyled by clang-format
* Restyled by astyle
Co-authored-by: ncnnnnn <67086033+ncnnnnn@users.noreply.github.com>
Co-authored-by: Restyled.io <commits@restyled.io>
5 years ago
fawdlstty
1d1cb29869
Fixed compile warning due to default cast ( #2201 )
5 years ago
Evgeny Proydakov
bc2471208f
Fixed compile warnings for gcc linux build. [-Wunused-parameter] [-Wunused-but-set-variable] ( #2198 )
5 years ago
nihui
b9296c259d
bring up vulkan 1.1 ( #2191 )
* query subgroup features
* compile spirv 1.3
* drop offline spirv build
* do not build tests for android and ios, as they are never tested anyway
* code style
5 years ago
Evgeny Proydakov
7dfb1cbafc
Fixed compile warning for x86 layer with reason: [-Wunused-variable] ( #2186 )
5 years ago
Evgeny Proydakov
478dbd546a
Fixed compile warning for x86 layer. [-Wunused-variable] ( #2184 )
5 years ago