Martin Kroeker
58dd7e4501
Change ARMV8 target to ARMV7 for BINARY=32
7 years ago
Martin Kroeker
802f0dbde1
More fixes for cross-compiling ARM64 targets
Fixed core naming for DYNAMIC_ARCH. Corrected GEMM_DEFAULT entries and added SYMV_P. Replaced outdated VULCAN define for ThunderX2T99 with ARMV8 to get basic definitions back. For issue #1908
7 years ago
Martin Kroeker
20d1aad13f
Fix missing quotes around thunderx targets
7 years ago
Martin Kroeker
e1eab96502
Merge pull request #1931 from martin-frbg/pr1921
Add -mavx2 to TARGET=HASWELL builds
7 years ago
Martin Kroeker
76b4b8980f
Use -dumpversion with gcc only
7 years ago
Martin Kroeker
49e0f485da
Add -mavx2 for TARGET=HASWELL if compiler supports and requires it
7 years ago
Martin Kroeker
26a3402773
Reflect ARMV8 target definition changes from PR1876
and create config target directory for cross-compiles.
7 years ago
Martin Kroeker
133c278ee5
Add DYNAMIC_CORE list for ARM64
cf #1908
7 years ago
Martin Kroeker
dceff5542c
Handle Android environments that identify as Linux ( #1898 )
* Handle Android environments that identify as Linux
termux terminal emulator does this, causing build failures through missed defines in common.h
7 years ago
Martin Kroeker
081ceb3e02
Propagate version number for openblas_get_config
7 years ago
Andrew
40cce0e353
handle cmake too
7 years ago
Martin Kroeker
2263d3906c
Merge pull request #1812 from martin-frbg/issue1806-2
Use KERNEL_DEFINITIONS rather than COMMON_OPTS to pass -march=skylake…
7 years ago
Martin Kroeker
81c9985c3a
Use KERNEL_DEFINITIONS rather than COMMON_OPTS to pass -march=skylake-avx512
7 years ago
Martin Kroeker
56ebc7b53e
Merge pull request #1808 from martin-frbg/issue1806
Add -march=skylake-avx512 to CFLAGS when the target is Skylake
7 years ago
Martin Kroeker
8a11ec19d1
Syntax fix
7 years ago
Martin Kroeker
fa53b903db
Add -march=skylake-avx512 to CFLAGS when the target is Skylake
Should fix 1806 and #1801
7 years ago
Martin Kroeker
84bcdf9c66
Revert "Add -march=skylake-avx512 when required"
7 years ago
Martin Kroeker
a9b51b8448
Merge pull request #1798 from martin-frbg/cmake-avx512
Add -march=skylake-avx512 when required
7 years ago
Martin Kroeker
eba394c711
Add -march=skylake-avx512 when required
fixes #1797
7 years ago
Martin Kroeker
02ef20a1e4
Merge pull request #1786 from martin-frbg/immintrin
Check for Immintrin.h presence in the AVX512 compatibility test as well
7 years ago
Martin Kroeker
4c3643ed7f
Check availability of immintrin.h in the AVX512 compatibility test
7 years ago
Yuri
2349e15149
Allow to install the 'interfare64' version concurrently with the regular version
7 years ago
Martin Kroeker
b1b743f434
Merge branch 'develop' into interim033
7 years ago
Martin Kroeker
2a589c4b28
Add USE_TLS option to switch between old and new memory.c
7 years ago
Martin Kroeker
25f2d25cfe
Merge pull request #1697 from martin-frbg/issue1696
Do not treat WIndows UWB builds as cross-compiling
7 years ago
Martin Kroeker
73131fa30a
Do not treat WIndows UWB builds as cross-compiling
7 years ago
Martin Kroeker
b74aef2816
Add -march=skylake-avx512 to AVX512 compile check and suppress its output
7 years ago
Martin Kroeker
26e1cfb653
Merge pull request #1607 from martin-frbg/dynarch
Move some x86_64 DYNAMIC_ARCH targets to new DYNAMIC_OLDER option
7 years ago
Martin Kroeker
02634b549b
Add template for OpenBLASConfig.cmake
7 years ago
Martin Kroeker
1cbd8f3ae4
Move some DYNAMIC_ARCH targets to new DYNAMIC_OLDER option
7 years ago
Martin Kroeker
cf234a0561
Merge pull request #1589 from fenrus75/skylakex
Initial support for SkylakeX / AVX512
7 years ago
Martin Kroeker
e4718b1fee
Better AVX512 test case
7 years ago
Martin Kroeker
7fb62aed7e
Check build system support for AVX512 instructions
7 years ago
Arjan van de Ven
99c7bba8e4
Initial support for SkylakeX / AVX512
This patch adds the basic infrastructure for adding the SkylakeX (Intel Skylake server)
target. The SkylakeX target will use the AVX512 (AVX512VL level) instruction set,
which brings 2 basic things:
1) 512 bit wide SIMD (2x width of AVX2)
2) 32 SIMD registers (2x the number on AVX2)
This initial patch only contains a trivial transofrmation of the Haswell SGEMM kernel
to AVX512VL; more will follow later but this patch aims to get the infrastructure
in place for this "later".
Full performance tuning has not been done yet; with more registers and wider SIMD
it's in theory possible to retune the kernels but even without that there's an
interesting enough performance increase (30-40% range) with just this change.
7 years ago
Martin Kroeker
6791294312
Merge pull request #1559 from martin-frbg/buildconf
Add build-time configuration options to pkgconfig file
7 years ago
Martin Kroeker
7d7564568c
Add build-time configuration options to pkgconfig file
7 years ago
Zhiyong Dang
1b83341d19
Fix race condition in blas_server_omp.c
Change-Id: Ic896276cd073d6b41930c7c5a29d66348cd1725d
7 years ago
Sacha
f81815e48a
Fix CMake cross-compiling
Without specifying thread count, NUM_THREADS would not be defined and CMake would fail.
This is because core count cannot be determined when cross-compiling.
8 years ago
xoviat
038bfbb86c
CMake: Remove unused wall option when FC=flang
8 years ago
Martin Kroeker
599de9e598
Restore LAPACKE files for Xgeqpf, Xggsvd and Xggsvp
These were inadvertently dropped from the list in my PR #1095
8 years ago
Martin Kroeker
0dc291d3fa
Merge pull request #1377 from isuruf/threads
Allow overriding NUM_THREADS in cmake
8 years ago
Isuru Fernando
e0ddd7d124
Allow overriding NUM_THREADS
8 years ago
martin
5056a044b2
fix location of lapacke_nancheck
8 years ago
martin
4054d32def
update cmake files
8 years ago
martin
2d52f0f4c3
update cmakefiles for lapack 3.8.0
8 years ago
Ian Henriksen
505dc08635
Update lapacke.cmake with routines added in LAPACK 3.7.0.
8 years ago
Ian Henriksen
61587b0670
Update lapack.cmake with additional routines from LAPACK version 3.7.0.
8 years ago
Ian Henriksen
632fc75d77
Allow using compilers other than gfortran in conjunction with
MSVC or clang-cl.
8 years ago
Martin Kroeker
962b20a9bb
Optionally add ReLAPACK to LIB_COMPONENTS
8 years ago
Martin Kroeker
c7a8512d12
Cmake fixes for DYNAMIC_ARCH builds and whitespace in path names ( #1323 )
* prebuild.cmake: Put quotes around path names that may contain whitespace
(Copied from alexkaratakis' PR #1295 )
* kernel/CMakeLists.txt: Fix common_lapack header inclusion and DYNAMIC_ARCH generation of ?neg_tcopy and ?laswp_ncopy files
* lapack/CMakeLists.txt: Use correct template for ?laswp_(plus,minus) functions
8 years ago