Diego Gomes
|
837e6b047e
|
Rasp bench (#531)
* add a benchmark results for raspberry model 3b+
|
7 years ago |
BUG1989
|
1b0e33460d
|
add armv7 int8 conv3x3s1,using vaddw to replace vadd and vmovl
|
7 years ago |
nihui
|
72411b7a6c
|
restore the old conv3x3s2 as reference, fast dilation convolution fails on striding
|
7 years ago |
nihui
|
1f20eb4e8c
|
pack weight and more unroll makes improvement, ~20% faster for conv3x3s2
|
7 years ago |
chensy
|
30cc738309
|
fix asm "invalid operand" error for target iOS armv7 on file dequantize_arm.cpp
|
7 years ago |
Diego Gomes
|
4d73407df8
|
fix gettid call for glibc
|
7 years ago |
Diego Gomes
|
534f38ed87
|
fix auxv read for elf64
|
7 years ago |
nihuini
|
2dbaf6f7b7
|
store int8 scale in binary
|
7 years ago |
nihui
|
cebded134a
|
enable pool allocator in sample project, display unscaled image
|
7 years ago |
nihui
|
fe14037777
|
more sub op preload
|
7 years ago |
nihui
|
2fe7ada4d8
|
add arm int8 convolution stub, preload group op for x86
|
7 years ago |
nihui
|
eac7c66a97
|
fix fp32 group convolution on x86
|
7 years ago |
nihui
|
5d04a3a45c
|
layer holds bottom blob scale, depthwise convolution read group scales
|
7 years ago |
nihui
|
354b95256c
|
bump param version, backward compatible
|
7 years ago |
nihuini
|
9843b9e158
|
Merge branch 'master' of https://github.com/Tencent/ncnn
|
7 years ago |
nihuini
|
2bc504925e
|
fix int8_scales from multiple blobs, fix #512
|
7 years ago |
nihui
|
af806a2d8d
|
Update README.md
|
7 years ago |
nihuini
|
da352916fe
|
fix pd using flag condition
|
7 years ago |
nihuini
|
6b536701c3
|
sub-mat shall be allocator-aware
|
7 years ago |
nihuini
|
e34aa7786a
|
armv7 int8 quantize/dequantize and conv1x1s1
|
7 years ago |
nihuini
|
55358f61b6
|
light mode is the default, add mobilenetv2ssdlite example
|
7 years ago |
nihui
|
dbf1c405d4
|
Create CONTRIBUTING.md
|
7 years ago |
nihuini
|
4be27a0a89
|
int8 inference on x86
|
7 years ago |
nihui
|
6eb6abfd4a
|
autotest never worked, delete it ;)
|
7 years ago |
nihui
|
a169cec363
|
core int8 inference, quantize and dequantize, net using flag, caffe2ncnn reads int8 scale table
|
7 years ago |
nihui
|
b6b90c888f
|
high resolution timestamp on windows
|
7 years ago |
nihui
|
af49e2cada
|
install allocator.h
|
7 years ago |
nihui
|
ae467fee25
|
project-wide NOMINMAX on msvc
|
7 years ago |
nihui
|
7e1f358084
|
fix build on msvc
|
7 years ago |
nihui
|
9706cd1447
|
implement ncnn blob/workspace allocator, fine-grained per-layer openmp threads control, fix #469
|
7 years ago |
nihui
|
5879cb4d15
|
sgemm outperform direct conv on large channel
|
7 years ago |
nihui
|
20c0794b36
|
Update README.md
|
7 years ago |
nihuini
|
4b8101e7fc
|
Revert "optimize interleave section for load first, about 5%~10% speed gain"
This reverts commit 1e4eaeeacd.
|
7 years ago |
nihui
|
56a667472a
|
sgemm is always faster on common channel size
|
7 years ago |
nihui
|
1e4eaeeacd
|
optimize interleave section for load first, about 5%~10% speed gain
|
7 years ago |
Qu Xiaofeng / 曲晓峰
|
d0cad77a15
|
Fixed two typos (#466)
|
8 years ago |
nihui
|
6895cbf810
|
single vldm is faster than two vld1 on armv7, and some pipeline optimize
|
8 years ago |
nihuini
|
05d7562a5d
|
reorder kernel weight, pipeline friendly ;)
|
8 years ago |
nihuini
|
0bbdbf4ff8
|
add mobilenet-yolo
|
8 years ago |
nihuini
|
543d764674
|
fix yolo preprocess, comment about mobilenet-yolo
|
8 years ago |
nihui
|
5c6ef31e07
|
-x
|
8 years ago |
nihui
|
eb089c0b32
|
add yolov2 example
|
8 years ago |
nihui
|
a94e5adfd1
|
fix debug build
|
8 years ago |
nihui
|
0b6791e2ba
|
convert BN ReLU6 Reorg YoloDetectionOutput Embed LSTM
|
8 years ago |
nihui
|
b8f4f024a4
|
implement reorg yolodetectionoutput layer from caffe-yolov2
|
8 years ago |
kalcohol
|
8491f2b6a3
|
fix error C2059 and C2589 when using std::min and std::max. (#456)
|
8 years ago |
BUG1989
|
b3965e26cb
|
Update README.md (#452)
|
8 years ago |
nihuini
|
ee98817446
|
proper first row/col handling in resize family, fix #429
|
8 years ago |
nihuini
|
511baa6718
|
optional image pixel api, fix #434
|
8 years ago |
nihui
|
74d1c1470f
|
update qcom810 iphone5s benchmark result
|
8 years ago |