nihui
|
2abeb038a0
|
unroll outch for conv3x3s2, about 30% faster :)
|
8 years ago |
nihui
|
97025668a9
|
unroll conv1x1s1 outch 6 inch 4 on armv7, about 2%~18% faster
|
8 years ago |
nihui
|
0a666c8cb9
|
Update README.md
|
8 years ago |
nihuini
|
dd0ae756de
|
batchnorm and scale on vector and image, fix #331
|
8 years ago |
AlanNewImage
|
e77eef3f3d
|
Update priorbox.cpp (#330)
|
8 years ago |
daquexian
|
3a5d7cfcce
|
Fix mobilenet v2's stride (#327)
|
8 years ago |
Yantao Xie
|
cd3617b11d
|
Set ArgMax's one_blob_only as true. (#325)
|
8 years ago |
Joe
|
9748c00b44
|
add image-level feature support (#320)
* add image-level feature support
* move special case out
* tab to space
|
8 years ago |
nihui
|
0e41c37250
|
convert convolution pad_w pad_h
|
8 years ago |
Tiancai Ye
|
ea95c7a7fc
|
fix a fix~~~ (#323)
|
8 years ago |
Tiancai Ye
|
3977d32eb9
|
Fix windows build fails (#321)
* fix windows build error
* remove wrong commit
|
8 years ago |
Yantao Xie
|
73340578c8
|
Remove the destructor definition from the lstm layer. (#319)
|
8 years ago |
Howave
|
5e7332e507
|
align memory start address (#318)
* make memory start address 4bytes aligned
* align memory start address for MSVC
|
8 years ago |
nihuini
|
fac262658c
|
use android prebuild releas folder, build arm64 jni library
|
8 years ago |
nihuini
|
4019505642
|
use clang, fix build, fix #292
|
8 years ago |
nihuini
|
e4c1ddbc45
|
rewrite inner loop in assembly, since gcc is sometimes foolish qaq, fix #312
|
8 years ago |
nihuini
|
aac70893f8
|
fix build on gcc
|
8 years ago |
nihuini
|
394bca8dbb
|
Merge branch 'master' of https://github.com/Tencent/ncnn
|
8 years ago |
nihuini
|
9ac305e160
|
create 3-dim sub blob for group convolution, fix #315
|
8 years ago |
Howave
|
415bfbdfa7
|
added arm layer compilation for arm-linux system (#316)
|
8 years ago |
nihuini
|
318d3abe66
|
bind register explicitly, fix #306, fix #310, fix #312
|
8 years ago |
Yantao Xie
|
2e9da1b95b
|
Add the epsilon parameter to the BatchNorm layer. (fix #303) (#311)
* Add the epsilon parameter to the BatchNorm layer. (fix #303)
* Move the eps into the sqrt.
|
8 years ago |
nihuini
|
231a52e469
|
fix build on aarch64 with gcc, fix #309
|
8 years ago |
BUG1989
|
4ebab2725d
|
Updata benchmark ReadMe (#308)
|
8 years ago |
BUG1989
|
af7019d3fc
|
fix compile error (#305)
|
8 years ago |
nihui
|
d7e31987fa
|
Update README.md
|
8 years ago |
nihui
|
004632d9e5
|
Update README.md
|
8 years ago |
nihui
|
ce9eeaba9a
|
Update README.md
|
8 years ago |
nihui
|
875a188d10
|
pre interleave kernel memory for winograd4, about 3%~20% speed gains
|
8 years ago |
nihui
|
d2c01019aa
|
fix convert depthwise deconvolution, fix #300
|
8 years ago |
dong
|
6ea09ebf2c
|
Use aarch64 assembly to replace arm intrinsics
|
8 years ago |
nihui
|
0fe4c6a757
|
Update README.md
|
8 years ago |
820169199
|
656de48631
|
add "#include <float.h>"
|
8 years ago |
Dong Xu
|
28154dcb29
|
fix vst1.f32 of coeff sum at eltwise_arm layer
In line 414: "vmla.f32 q1, q0, %q6 \n", destination register is q1 instead of q0, So, replace the {d0-d1} of line 416 with {d2-d3}.
|
8 years ago |
nihui
|
57f89a0245
|
convert MatMul
|
8 years ago |
nihui
|
4d6fa6cc79
|
convert Constant
|
8 years ago |
nihui
|
875a042da8
|
convert LRN bias
|
8 years ago |
nihui
|
0fd701112e
|
load LRN bias from param
|
8 years ago |
nihui
|
7d1e49584d
|
call Innerproduct for convolution on flattened blob
|
8 years ago |
nihui
|
caf105abc5
|
convert BinaryOp
|
8 years ago |
harhar539
|
9a8486a823
|
1.fix pad tail bug in commit d1ea2a3 at pooling layer
|
8 years ago |
nihui
|
b1aec69ff9
|
d31 is useless
|
8 years ago |
nihuini
|
5e484a47ef
|
fix build, second try
|
8 years ago |
nihui
|
5f0fa95f61
|
fix build
|
8 years ago |
nihui
|
9c05e48e87
|
Update README.md
|
8 years ago |
nihui
|
ecaadb20c6
|
fix result blob in squeezenet example
|
8 years ago |
nihui
|
68f016936d
|
convert Dropout Sum and InnerProduct-like Gemm, inception_v1 works :P
|
8 years ago |
nihui
|
d1ea2a34b4
|
rewrite pooling pad scheme, global pooling return continous blob
|
8 years ago |
nihui
|
5cdcf33cfc
|
convert softmax, squeezenet model works :D
|
8 years ago |
nihui
|
a335ae840f
|
add mobilenet v2 result
|
8 years ago |