685 Commits (a3a2548aa28a9ff7924f76d5085fabe94de79ddb)
 

Author SHA1 Message Date
  Diego Gomes 837e6b047e Rasp bench (#531) 7 years ago
  BUG1989 1b0e33460d add armv7 int8 conv3x3s1,using vaddw to replace vadd and vmovl 7 years ago
  nihui 72411b7a6c restore the old conv3x3s2 as reference, fast dilation convolution fails on striding 7 years ago
  nihui 1f20eb4e8c pack weight and more unroll makes improvement, ~20% faster for conv3x3s2 7 years ago
  chensy 30cc738309 fix asm "invalid operand" error for target iOS armv7 on file dequantize_arm.cpp 7 years ago
  Diego Gomes 4d73407df8 fix gettid call for glibc 7 years ago
  Diego Gomes 534f38ed87 fix auxv read for elf64 7 years ago
  nihuini 2dbaf6f7b7 store int8 scale in binary 7 years ago
  nihui cebded134a enable pool allocator in sample project, display unscaled image 7 years ago
  nihui fe14037777 more sub op preload 7 years ago
  nihui 2fe7ada4d8 add arm int8 convolution stub, preload group op for x86 7 years ago
  nihui eac7c66a97 fix fp32 group convolution on x86 7 years ago
  nihui 5d04a3a45c layer holds bottom blob scale, depthwise convolution read group scales 7 years ago
  nihui 354b95256c bump param version, backward compatible 7 years ago
  nihuini 9843b9e158 Merge branch 'master' of https://github.com/Tencent/ncnn 7 years ago
  nihuini 2bc504925e fix int8_scales from multiple blobs, fix #512 7 years ago
  nihui af806a2d8d
Update README.md 7 years ago
  nihuini da352916fe fix pd using flag condition 7 years ago
  nihuini 6b536701c3 sub-mat shall be allocator-aware 7 years ago
  nihuini e34aa7786a armv7 int8 quantize/dequantize and conv1x1s1 7 years ago
  nihuini 55358f61b6 light mode is the default, add mobilenetv2ssdlite example 7 years ago
  nihui dbf1c405d4
Create CONTRIBUTING.md 7 years ago
  nihuini 4be27a0a89 int8 inference on x86 7 years ago
  nihui 6eb6abfd4a autotest never worked, delete it ;) 7 years ago
  nihui a169cec363 core int8 inference, quantize and dequantize, net using flag, caffe2ncnn reads int8 scale table 7 years ago
  nihui b6b90c888f
high resolution timestamp on windows 7 years ago
  nihui af49e2cada
install allocator.h 7 years ago
  nihui ae467fee25
project-wide NOMINMAX on msvc 7 years ago
  nihui 7e1f358084
fix build on msvc 7 years ago
  nihui 9706cd1447 implement ncnn blob/workspace allocator, fine-grained per-layer openmp threads control, fix #469 7 years ago
  nihui 5879cb4d15
sgemm outperform direct conv on large channel 7 years ago
  nihui 20c0794b36
Update README.md 7 years ago
  nihuini 4b8101e7fc Revert "optimize interleave section for load first, about 5%~10% speed gain" 7 years ago
  nihui 56a667472a
sgemm is always faster on common channel size 7 years ago
  nihui 1e4eaeeacd optimize interleave section for load first, about 5%~10% speed gain 7 years ago
  Qu Xiaofeng / 曲晓峰 d0cad77a15 Fixed two typos (#466) 8 years ago
  nihui 6895cbf810 single vldm is faster than two vld1 on armv7, and some pipeline optimize 8 years ago
  nihuini 05d7562a5d reorder kernel weight, pipeline friendly ;) 8 years ago
  nihuini 0bbdbf4ff8 add mobilenet-yolo 8 years ago
  nihuini 543d764674 fix yolo preprocess, comment about mobilenet-yolo 8 years ago
  nihui 5c6ef31e07 -x 8 years ago
  nihui eb089c0b32 add yolov2 example 8 years ago
  nihui a94e5adfd1 fix debug build 8 years ago
  nihui 0b6791e2ba convert BN ReLU6 Reorg YoloDetectionOutput Embed LSTM 8 years ago
  nihui b8f4f024a4 implement reorg yolodetectionoutput layer from caffe-yolov2 8 years ago
  kalcohol 8491f2b6a3 fix error C2059 and C2589 when using std::min and std::max. (#456) 8 years ago
  BUG1989 b3965e26cb Update README.md (#452) 8 years ago
  nihuini ee98817446 proper first row/col handling in resize family, fix #429 8 years ago
  nihuini 511baa6718 optional image pixel api, fix #434 8 years ago
  nihui 74d1c1470f
update qcom810 iphone5s benchmark result 8 years ago