Martin Kroeker
25f2d25cfe
Merge pull request #1697 from martin-frbg/issue1696
Do not treat WIndows UWB builds as cross-compiling
7 years ago
Martin Kroeker
73131fa30a
Do not treat WIndows UWB builds as cross-compiling
7 years ago
Martin Kroeker
b74aef2816
Add -march=skylake-avx512 to AVX512 compile check and suppress its output
7 years ago
Martin Kroeker
26e1cfb653
Merge pull request #1607 from martin-frbg/dynarch
Move some x86_64 DYNAMIC_ARCH targets to new DYNAMIC_OLDER option
7 years ago
Martin Kroeker
02634b549b
Add template for OpenBLASConfig.cmake
7 years ago
Martin Kroeker
1cbd8f3ae4
Move some DYNAMIC_ARCH targets to new DYNAMIC_OLDER option
7 years ago
Martin Kroeker
cf234a0561
Merge pull request #1589 from fenrus75/skylakex
Initial support for SkylakeX / AVX512
7 years ago
Martin Kroeker
e4718b1fee
Better AVX512 test case
7 years ago
Martin Kroeker
7fb62aed7e
Check build system support for AVX512 instructions
7 years ago
Arjan van de Ven
99c7bba8e4
Initial support for SkylakeX / AVX512
This patch adds the basic infrastructure for adding the SkylakeX (Intel Skylake server)
target. The SkylakeX target will use the AVX512 (AVX512VL level) instruction set,
which brings 2 basic things:
1) 512 bit wide SIMD (2x width of AVX2)
2) 32 SIMD registers (2x the number on AVX2)
This initial patch only contains a trivial transofrmation of the Haswell SGEMM kernel
to AVX512VL; more will follow later but this patch aims to get the infrastructure
in place for this "later".
Full performance tuning has not been done yet; with more registers and wider SIMD
it's in theory possible to retune the kernels but even without that there's an
interesting enough performance increase (30-40% range) with just this change.
7 years ago
Martin Kroeker
6791294312
Merge pull request #1559 from martin-frbg/buildconf
Add build-time configuration options to pkgconfig file
7 years ago
Martin Kroeker
7d7564568c
Add build-time configuration options to pkgconfig file
7 years ago
Zhiyong Dang
1b83341d19
Fix race condition in blas_server_omp.c
Change-Id: Ic896276cd073d6b41930c7c5a29d66348cd1725d
7 years ago
Sacha
f81815e48a
Fix CMake cross-compiling
Without specifying thread count, NUM_THREADS would not be defined and CMake would fail.
This is because core count cannot be determined when cross-compiling.
8 years ago
xoviat
038bfbb86c
CMake: Remove unused wall option when FC=flang
8 years ago
Martin Kroeker
599de9e598
Restore LAPACKE files for Xgeqpf, Xggsvd and Xggsvp
These were inadvertently dropped from the list in my PR #1095
8 years ago
Martin Kroeker
0dc291d3fa
Merge pull request #1377 from isuruf/threads
Allow overriding NUM_THREADS in cmake
8 years ago
Isuru Fernando
e0ddd7d124
Allow overriding NUM_THREADS
8 years ago
martin
5056a044b2
fix location of lapacke_nancheck
8 years ago
martin
4054d32def
update cmake files
8 years ago
martin
2d52f0f4c3
update cmakefiles for lapack 3.8.0
8 years ago
Ian Henriksen
505dc08635
Update lapacke.cmake with routines added in LAPACK 3.7.0.
8 years ago
Ian Henriksen
61587b0670
Update lapack.cmake with additional routines from LAPACK version 3.7.0.
8 years ago
Ian Henriksen
632fc75d77
Allow using compilers other than gfortran in conjunction with
MSVC or clang-cl.
8 years ago
Martin Kroeker
962b20a9bb
Optionally add ReLAPACK to LIB_COMPONENTS
8 years ago
Martin Kroeker
c7a8512d12
Cmake fixes for DYNAMIC_ARCH builds and whitespace in path names ( #1323 )
* prebuild.cmake: Put quotes around path names that may contain whitespace
(Copied from alexkaratakis' PR #1295 )
* kernel/CMakeLists.txt: Fix common_lapack header inclusion and DYNAMIC_ARCH generation of ?neg_tcopy and ?laswp_ncopy files
* lapack/CMakeLists.txt: Use correct template for ?laswp_(plus,minus) functions
8 years ago
Sacha
7a867082d8
Fix open_blas.config which was never working out-of-source. Remove need for gen_config_h.exe. If OpenMP is requested, do not silently ignore when it isn't available.
8 years ago
Sacha Refshauge
47ebce4d1a
Clean up, fix old typos. Simplify arch usages. Move system arch check to earlier position.
8 years ago
Sacha Refshauge
69b560751c
Improvements to previous commit (cross-compile).
Fix typos and bad if statements discovered in 0.2.20.
8 years ago
Sacha Refshauge
0a7a527a92
Add support for cross compiling.
Add support for not having host compiler as CMake cannot detect such a compiler.
Add support for not using getarch.
Successfully builds Android ARMV8. Any target can be added by supplying the TARGET_CORE config in prebuild.cmake.
8 years ago
Sacha Refshauge
6aac06587d
Fix typos and use CMake OpenMP support.
8 years ago
7c1acc07f0
Fix bug that required fortran. Fix bug that needed CXX var. Remove redundant set vars. Fix threading detection. Do not attempt to run code if cross compiling.
8 years ago
38d273ea03
Drop some redundant vars and improve arch detection in CMake.
8 years ago
90a4dab501
Let CMake deal with build type.
8 years ago
Isuru Fernando
d3b677fe87
Add commonobjs
8 years ago
Isuru Fernando
505b218829
Merge remote-tracking branch 'upstream/develop' into dyn
8 years ago
Isuru Fernando
d9346930dd
Merge remote-tracking branch 'upstream/develop' into develop
8 years ago
Isuru Fernando
1d1854032b
Add missing EXCAVATOR
8 years ago
Isuru Fernando
d798487213
Fixes for dynamic_arch. almost there
8 years ago
Isuru Fernando
4260215adf
Support DYNAMIC_ARCH with cmake
8 years ago
Isuru Fernando
89ec2be110
Fix lapacke copying
8 years ago
Isuru Fernando
d245caa49a
Support out-of-source build
8 years ago
Isuru Fernando
7a96499b29
Don't change timestamps
8 years ago
Isuru Fernando
5e0f67c666
Make ARCH variable a CACHE variable
8 years ago
Isuru Fernando
ea1095135e
Ninja complains that file openblas.def does not exist
8 years ago
Isuru Fernando
4c5df489db
clang on windows needs FU=''
8 years ago
Isuru Fernando
dc24914415
check compiler is msvc instead of msvc
8 years ago
Martin Kroeker
ad2462811a
Do not add -lpthread on Android builds ( #1229 )
* Do not add -lpthread on Android builds
* Do not add -lpthread on Android cmake builds
8 years ago
Neil Shipp
34513be726
Add Microsoft Windows 10 UWP build support
8 years ago
Neil Shipp
65e56cb29d
Add 64bit support for Microsoft Visual Studio
8 years ago