Martin Kroeker
abea977ded
Merge pull request #2162 from martin-frbg/pgi
Fixes for PGI compiler
6 years ago
Martin Kroeker
7684c4f8f8
PGI compiler does not like -march=native
6 years ago
Martin Kroeker
4ea794a522
Avoid unintentional activation of TLS code via USE_TLS=0
fixes #2149
6 years ago
Martin Kroeker
0ab4076dd8
Merge pull request #2156 from martin-frbg/issue2154
Add gfortran workaround for C->FORTRAN ABI violation
6 years ago
Martin Kroeker
900d5a3205
Add gfortran workaround for ABI violations in LAPACKE
for #2154 (see gcc bug 90329)
6 years ago
Michael Lass
7a9a4dbc4f
Fix detection of AVX512 capable compilers in getarch
21eda8b5 introduced a check in getarch.c to test if the compiler is capable of
AVX512. This check currently fails, since the used __AVX2__ macro is only
defined if getarch itself was compiled with AVX2/AVX512 support. Make sure this
is the case by building getarch with -march=native on x86_64. It is only
supposed to run on the build host anyway.
6 years ago
Martin Kroeker
f66c11fc22
Remove unrelated change
6 years ago
Martin Kroeker
d2cb610272
Add option USE_LOCKING for single-threaded build with locking support
for calling from concurrent threads
6 years ago
Rashmica Gupta
bcdf1d4917
Add in runtime CPU detection for POWER.
6 years ago
Martin Kroeker
c4868d11c0
Make sure that AVX512 is disabled in 32bit builds
for #2033
7 years ago
Martin Kroeker
d66214c946
Make x86_32 imply NO_AVX2, NO_AVX512 in addition to NO_AVX
fixes #2033
7 years ago
Martin Kroeker
36b844af88
Change ARMV8 target to ARMV7 when BINARY32 is set
fixes #1961
7 years ago
TiborGY
d11554c88f
Validate user supplied TARGET ( #1941 )
the build will now abort with an error message when an undefined build TARGET is named
Fixes #1938
7 years ago
Martin Kroeker
5bd21ab6e1
Make sure that -fPIC is present when needed
override user-provided FFLAGS if necessary
7 years ago
Martin Kroeker
c5f8aeff2d
Merge branch 'develop' into fbsd12
7 years ago
Martin Kroeker
ea6d1b96bd
Update Makefile.system
7 years ago
Martin Kroeker
3c9e3faedb
fixup BSD naming of powerpc arch
7 years ago
Andrew
44c81fd135
oops
7 years ago
Andrew
26b3710485
Add architecture mappings for FreeBSD12
7 years ago
pkubaj
f85ce54d4a
Use correct Makefile on powerpc64
FreeBSD uses powerpc64 name for POWER architecture. Use correct Makefile for this platform.
7 years ago
Martin Kroeker
a29ec458c2
propagate verison number for openblas_config_version
7 years ago
Martin Kroeker
b0c15bacc1
Merge pull request #1863 from martin-frbg/aix_install3
Set LIBSONAME suffix to .a for AIX
7 years ago
Martin Kroeker
cfb0f5b0f8
Set LIBSONAME suffix to .a for AIX
another fix for #1803
7 years ago
Martin Kroeker
5f8f0583d4
Merge branch 'develop' into fc-1847
7 years ago
Andrew
7d3502b500
Add -frecursive gfortran option by default
7 years ago
Ashwin Sekhar T K
d5aeff636f
ARM64: Enable DYNAMIC_ARCH
Enable DYNAMIC_ARCH feature on ARM64. This patch uses the cpuid
feature in linux kernel to detect the core type at runtime
(https://www.kernel.org/doc/Documentation/arm64/cpu-feature-registers.txt ).
If this feature is missing in kernel, then the user should use the
OPENBLAS_CORETYPE env variable to select the desired core type.
7 years ago
Martin Kroeker
c0d7cd3dac
Merge pull request #1799 from martin-frbg/issue1796
Handle conflicting usage of ARCH in at least some BSD environments
7 years ago
Martin Kroeker
d3d58f8ee5
Catch conflicting usage of ARCH in at least some BSD environments
fixes #1796
7 years ago
Martin Kroeker
2caa2210bb
Add USE_TLS option to choose between old and new implementation of memory.c
7 years ago
Martin Kroeker
0b2b83d9ed
Add support for a user-defined list of dynamic targets
7 years ago
Martin Kroeker
6c2d90ba77
Move some DYNAMIC_ARCH targets to new DYNAMIC_OLDER option
7 years ago
Martin Kroeker
f6021c798d
Re-enable QUIET_MAKE
7 years ago
Martin Kroeker
e8002536ec
disable quiet_make for the moment
7 years ago
Martin Kroeker
15a78d6b66
export NO_AVX512 setting
7 years ago
Martin Kroeker
b7feded85a
Propagate NO_AVX512 via CCOMMON_OPT
7 years ago
Martin Kroeker
a7d0f49cec
Add SKYLAKEX to DYNAMIC_CORE list only if AVX512 is available
7 years ago
Martin Kroeker
f1fb9a4745
Propagate NO_AVX512 if needed
7 years ago
Arjan van de Ven
99c7bba8e4
Initial support for SkylakeX / AVX512
This patch adds the basic infrastructure for adding the SkylakeX (Intel Skylake server)
target. The SkylakeX target will use the AVX512 (AVX512VL level) instruction set,
which brings 2 basic things:
1) 512 bit wide SIMD (2x width of AVX2)
2) 32 SIMD registers (2x the number on AVX2)
This initial patch only contains a trivial transofrmation of the Haswell SGEMM kernel
to AVX512VL; more will follow later but this patch aims to get the infrastructure
in place for this "later".
Full performance tuning has not been done yet; with more registers and wider SIMD
it's in theory possible to retune the kernels but even without that there's an
interesting enough performance increase (30-40% range) with just this change.
7 years ago
Zhang Xianyi
50acc40613
Merge pull request #1536 from WestAlgo/develop
Fix race condition in blas_server_omp.c
7 years ago
Martin Kroeker
12398e53ce
Merge pull request #1553 from martin-frbg/ifort-openmpflag
Change -openmp to -fopenmp for ifort entry as well
7 years ago
Martin Kroeker
193f835662
Change -openmp to -fopenmp for ifort entry as well
7 years ago
Martin Kroeker
e3a069f108
Merge pull request #1550 from martin-frbg/ifort-openmpflag
Update compiler flag for openmp use with ICC
7 years ago
Martin Kroeker
65b8a5c5d8
Update compiler flag for openmp use with ICC
The deprecated -openmp option was finally removed in favor of -qopenmp or -fopenmp, picking the latter to stay compatible with Intel compiler versions before 2015 (when -q options were introduced). Fixes #1546
7 years ago
Martin Kroeker
71051259e0
Restore compiler options for mips P5600 target
7 years ago
Martin Kroeker
d94d7baf7e
Add mips32r2 api target
7 years ago
Zhiyong Dang
1b83341d19
Fix race condition in blas_server_omp.c
Change-Id: Ic896276cd073d6b41930c7c5a29d66348cd1725d
7 years ago
Alex Arslan
8f811a9312
Reinstate macOS logic
7 years ago
Alex Arslan
36a17536ca
Compile with cc rather than gcc whenever possible
7 years ago
Alex Arslan
a41d241a0e
Add support for DragonFly BSD
7 years ago
Alex Arslan
8da6b6ae52
Allow building on OpenBSD
With this change, OpenBLAS builds and all tests pass on OpenBSD 6.2
using Clang. Tested on x86-64 only, with and without DYNAMIC_ARCH=1.
7 years ago