nihuini
11794675f3
apple a11 and a12 do not support armv8.2 dotprod, restore the fp16-only optimized path
4 years ago
Tijmen Verhulsdonck
eaa7e24db6
Added ability to switch AVX/AVX2 during runtime ( #3076 )
4 years ago
nihui
1c31ac2549
runtime cpu dispatch for mips msa and loongson mmi
5 years ago
nihuini
76b478f59b
the zfh detector hacking
5 years ago
nihuini
afc02d57f9
runtime detect armv8.2 dotprod
5 years ago
nihui
5d6f03dbbd
asm word for risc-v without v
5 years ago
nihui
a1b06baec8
fix build
5 years ago
nihui
17936e9f54
fix packing risc-v test, add cpu_riscv_vlenb()
5 years ago
nihui
11958424c2
runtime riscv v and zfh dispatch, riscv v optimization for cast
5 years ago
nihui
45bf3cd779
add runtime riscv v detection function, the initial c906 riscv linux toolchain
5 years ago
nihuini
e86799e95f
fix get_big_cpu_count return zero on smp cpu
5 years ago
Youngsoo Lee
b9bed8d993
feat: add denormal options ( #2656 )
* feat: add denormal options
Flush-To-Zero(FTZ) and Denormals-Are-Zero(DAZ) are modes that bypass IEEE754 methods of dealing with denormal floating-point numbers on x86_64 and some x86 CPUs.
* feat: Integrate `flush_denormals` into `Extractor::extract`
* chore: replace global variable with `ThreadLocalStorage`
5 years ago
sunnycase
124d2c3d85
Support V831 ( #2478 )
5 years ago
Evgeny Proydakov
d8478c11bc
Fixed compile warning in src/cpu.cpp for ios 32/64 bit compilation. [-Wunused-variable] ( #2452 )
5 years ago
nihuini
1a3191e245
fix libncnn build with gcc-4.8 and gcc-4.4, fix #2388
5 years ago
nihui
1184404fbf
support Apple M1 Silicon ( #2335 )
* recognize apple M1
* macos ios cpuset and thread affinity
* big little cpu topology for macos and ios
* silence affinity not supported error
5 years ago
Zhuo Zhang
f05999e792
get_little_cpu_count() and get_big_cpu_count() ( #2247 )
5 years ago
Zhuo Zhang
6475d782c1
try to support Open Harmony OS ( #2241 )
5 years ago
nihui
e4a4b51d27
openmp on webassembly ( #2234 )
* openmp on webasm works
* fix compile flags
* dynamic kmp runtime initialization
* clang simpleomp ci
* fix dispatch on unique cpu
5 years ago
ncnnnnn
a135900094
fix fscanf -Wunused-result ( #2205 )
5 years ago
tpoisonooo
2c8288555c
fix(cpu): cpu number bigger than 64 ( #2083 )
Co-authored-by: nihui <shuizhuyuanluo@126.com>
5 years ago
Zhuo Zhang
418047661c
fix #1984 & fix cmake ( #2000 )
5 years ago
nihuini
4e4f0baa73
set openmp blocktime 20 for reducing power consumption, blocktime option
5 years ago
nihui
bb5bfe3841
avx2 infrastructure ( #1943 )
5 years ago
nihui
11cffce114
armv8.2 infrastructure ( #1856 )
* runtime cpu dispatch
* force thread one
* disable openmp for coverage
* simplify test layer
* print NCNN_TARGET_ARCH
* less ci build variants
* weight fp16 storage option
* test convdw int8
* apple a12 a13
* ncnn_add_layer ncnn_add_shader cmake macro
5 years ago
nihui
3ef995ed1e
format code style and setup restyled.io ( #1840 )
6 years ago
nihuini
16aa0bff7b
fix cpu affinity mask unsigned overflow
6 years ago
tpoisonooo
8e1c3ac4d1
Add crop para check ( #1825 )
* add copy_cut_border check; fix compile warnings
6 years ago
SunTY
705dd36a31
simplestl is an alternative std vector string implementation ( #1762 )
* 去掉对stl的依赖
* 头文件名,push_back改正
* 去掉构造托管
* 好像是折腾
* data 的返回改为指针,非指针引用
* resize一处写错
* stdint
* 加入c_str
* 改文件名为小写
* NCNN_SIMPLESTL option
* simplestl default to OFF
* Update linux-x64-cpu-gcc.yml
* Update linux-x64-cpu-gcc.yml
* Update linux-x64-cpu-clang.yml
* drop functional header
* arm32 arm64 simplestl ci
* 修改一处内存泄漏, 去掉编译器警告
* resize时默认量的bug
Co-authored-by: nihuini <nihuini@tencent.com>
Co-authored-by: nihui <shuizhuyuanluo@126.com>
6 years ago
Naiyang Lin
ceef2470a5
Add logger.h ( #1753 )
6 years ago
nihuini
ee41ef4a37
include <limit.h> for INT_MAX, fix #1631
6 years ago
nihuini
c009928628
sizeof return byte but not bit
6 years ago
nihuini
4c6bf24205
explicit cpu thread affinity
6 years ago
nihui
c819b4d839
fix build without openmp
6 years ago
nihui
2905639890
fix null fp handling
6 years ago
BUG1989
69e2693c87
fix the bug of SMP cpu powersave not supported.
6 years ago
Corentin Barreau
736ed5e835
Support for Raspberry Pi 3 compilation ( #640 )
* Add: pi3.toolchain.cmake - CMake variables for Pi3 compilation.
* Modify: CMakeLists.txt - Add definitions for Pi3 compilation.
* Modify: src/cpu.cpp - Use getpid() if -PI3 is used.
7 years ago
nihuini
f78e8fd8e4
use max threads as cpu count on openmp capable runtime
7 years ago
Diego Gomes
4d73407df8
fix gettid call for glibc
7 years ago
Diego Gomes
534f38ed87
fix auxv read for elf64
7 years ago
nihuini
90643630c2
apple a10/a11 is armv8.2-a
8 years ago
nihui
2d4ae30508
fallback to all cores
8 years ago
nihui
bc99d5123b
set smp cpu affinity to all cores
8 years ago
nihuini
a36af39040
improve cpu max frequency detection, fix #146
8 years ago
HaoChen
5fda804f11
remove redundant “+” from line 390
8 years ago
Zuyu Zhang
d5f0214ec7
Fixed the build warnings and added gitignore.
8 years ago
nihuini
b7db8be4f6
add ncnn source qwq
9 years ago