Martin Kroeker
80373ea039
More fixes for silly misedits
8 years ago
Martin Kroeker
d12b75a6c4
Fixup braces lost in previous edit
8 years ago
Martin Kroeker
7294fb1d9d
Merge branch 'develop' into cgroups
8 years ago
Zhang Xianyi
2a7c6930ac
Merge pull request #1234 from brada4/develop
Fix write past fixed size buffer
8 years ago
Andrew
529bfc36ec
Fix write past fixed size buffer
8 years ago
Martin Kroeker
731c518cff
Add files via upload
8 years ago
Martin Kroeker
29fc429d9a
Honor cgroup/cpuset constraints when enumerating cpus
8 years ago
Martin Kroeker
3db2adf872
Merge pull request #1230 from martin-frbg/rhel5
Add sched_getcpu implementation for pre-2.6 glibc
8 years ago
Martin Kroeker
c1cf62d2c0
Add sched_getcpu implementation for pre-2.6 glibc
Fixes #1210 , compilation on RHEL5 with affinity enabled
8 years ago
Zhang Xianyi
bfe1656b8b
Merge pull request #1225 from martin-frbg/stolen_from_wernsaar_fork
fixed syrk_thread.c taken from wernsaar
8 years ago
Martin Kroeker
49e62c0e77
fixed syrk_thread.c taken from wernsaar
Stride calculation fix copied from https://github.com/wernsaar/OpenBLAS/commit/88900e1
8 years ago
Neil Shipp
34513be726
Add Microsoft Windows 10 UWP build support
8 years ago
Neil Shipp
65e56cb29d
Add 64bit support for Microsoft Visual Studio
8 years ago
James Cowgill
59c97cfee4
memory: Fix buffer overflow when position == NUM_BUFFERS
8 years ago
James Cowgill
5fecfe0f42
memory: switch loop condition around in blas_memory_free
Before this commit, the "position < NUM_BUFFERS" loop condition from
blas_memory_free will be completely optimized away by GCC. This is
because the condition can only be false after undefined behavior has
already been invoked (reading past the end of an array). As a
consequence of this bug, GCC also removes the subsequent if statement
and all the code after the error label because all of it is dead.
This commit switches the loop condition around so it works as intended.
8 years ago
Gian-Carlo Pascutto
9c884986ad
Add an extra familiy/model combination used by AMD Steamrolller (Godavari).
8 years ago
Gian-Carlo Pascutto
0cbd2d34e4
Recognize ZEN when passed as OPENBLAS_CORETYPE.
8 years ago
Gian-Carlo Pascutto
62979fd104
Fix dynamic detection for ZEN CPUs.
8 years ago
Denis Steckelmacher
c9ff735da6
Add ZEN support (tested for auto-detected static backend)
9 years ago
Martin Kroeker
ffc1d6c468
Merge pull request #1108 from ashwinyes/develop_20170203_thunderx2t99
Optimized Implementations for ThunderX2T99
9 years ago
Ashwin Sekhar T K
a86474c6f7
THUNDERX2T99: Performance fix for ZGEMM
9 years ago
Ashwin Sekhar T K
19ba133383
THUNDERX2T99: Add Optimized ZGEMM Implementation
9 years ago
Andrew
5088523786
detect apollo lake for real
9 years ago
Elliot Saba
1d8ab99e09
Add `exfamily == 9` case (Kaby Lake) to dynamic arch detection
9 years ago
Martin Koehler
76c6e33e54
Enable EXCAVATOR kernels for A12-9800
9 years ago
Ashwin Sekhar T K
2757b49767
THUNDERX2T99: Add Optimized CGEMM Implementation
9 years ago
Ashwin Sekhar T K
f279ff4789
THUNDERX2T99: Add Optimized SGEMM Implementation
9 years ago
Zhang Xianyi
0863a0d4b4
Merge pull request #1061 from ashwinyes/develop_aarch64_vulcan_thunderx_patch
Add new targets for ARM64
9 years ago
Werner Saar
c1c5a63d3c
prepared parameter.c for UNROLL values, that are not a power of two
9 years ago
Ashwin Sekhar T K
4b55fae337
ARM64: Add Cavium THUNDERX2T99 Target
9 years ago
Ashwin Sekhar T K
0b8e876d89
VULCAN: Add optimized DGEMM implementation
9 years ago
Ashwin Sekhar T K
4713e7c47f
ARM64: Add the VULCAN Target
9 years ago
jiahaipeng
1aa1e6cb54
modify the blas_l1_thread.c for support multi-threded for L1 fuction with return value
9 years ago
Werner Saar
b9bb009236
Merge pull request #1053 from wernsaar/develop
prepared driver/level3 functions for UNROLL values, that are not a po…
9 years ago
Werner Saar
a2672d5589
prepared driver/level3 functions for UNROLL values, that are not a power of two
9 years ago
Martin Kroeker
51aa157e64
Relocate declaration of alloc_lock outside ifdef block
9 years ago
Martin Kroeker
87c7d10b34
Fix thread data races detected by helgrind 3.12
Ref. #995 , may possibly help solve issues seen in 660,883
9 years ago
Martin Kroeker
0ef7841473
Update xerbla.c
9 years ago
Martin Kroeker
104ad066af
Use appropriate int32/int64 format for error number in message string
9 years ago
Alex Arslan
a16ace68f5
Include system headers on FreeBSD
9 years ago
Martin Kroeker
596ead0f8d
Add files via upload
9 years ago
Zhang Xianyi
66c9a9b33d
Merge pull request #981 from howard0su/develop
USE NPROCESSOR_CONF instaed of NPORCESSOR_ONLN
9 years ago
Martin Kroeker
8a8f3932eb
Update dynamic.c
Add Bay Trail "Pentium N3520" atom
9 years ago
Howard Su
ff1da01476
USE NPROCESSOR_CONF instaed of NPORCESSOR_ONLN
to determine the number of CPU. In ARM platform,
online CPU will increasing when there is more workload.
while configure cpu is the max number of CPU.
9 years ago
Zhang Xianyi
ef52a9266b
Fixed #979 . Patch for NetBSD.
9 years ago
Martin Kroeker
7de829f713
Update dynamic.c
Add Braswell (extended model 4, model 12) N3150 as Nehalem
9 years ago
John Biddiscombe
053044ae4d
Replace CMAKE_SOURCE_DIR/CMAKE_BINARY_DIR with PROJECT_SOURCE_DIR/PROJECT_BINARY_DIR
If OpenBLAS is built using add_subdirectory(OpenBlas) as part of another project
then the paths set by CMAKE_XXX_DIR are relative to the parent project
and not the OpenBLAS project.
9 years ago
Ashwin Sekhar T K
0fb380c966
Update NUMA CPU binding
When the number of process can all be
accommodated within the current node,
then use cores from the current node only.
9 years ago
Werner Saar
78b05f6476
bugfix for EXCAVATOR and DYNAMIC_ARCH
9 years ago
Werner Saar
2b967590a0
bugfix in dynamic.c
9 years ago