Martin Kroeker
cf6df9464c
Document the stub status of the QUAD_PRECiSION code ( #1772 )
* Document the stub status of the QUAD_PRECiSION code inherited from GotoBLAS2
in response to #1769
7 years ago
Martin Kroeker
4d183e5567
Merge pull request #1765 from martin-frbg/issue1761
Do not use the new TLS-enabled memory allocator for non-threaded builds, and disable TLS by default in gmake as well
7 years ago
Martin Kroeker
34d55fd165
Merge pull request #1764 from yurivict/64-suffix
Allow to install the 'interface64' version concurrently with the regular version
7 years ago
Martin Kroeker
b991570210
Merge pull request #1762 from martin-frbg/issue1710-2
Add explicit casts to silence compiler warnings
7 years ago
Martin Kroeker
288aeea8a2
Fix default settings - USE_TLS and USE_SIMPLE_THREADED_LEVEL3 should both be off
7 years ago
Martin Kroeker
1ad1e79062
Catch inadvertent USE_TLS=0 declaration
for #1766
7 years ago
Martin Kroeker
b402626509
Do not use the new TLS code for non-threaded builds even if USE_TLS is set
Workaround for #1761 as that exposed a problem in the new code (which was intended to speed up multithreaded code only anyway).
7 years ago
Martin Kroeker
ec0cac1669
Merge pull request #4 from xianyi/develop
Update branch
7 years ago
Yuri
2349e15149
Allow to install the 'interfare64' version concurrently with the regular version
7 years ago
Martin Kroeker
f3c262156e
Add an explicit cast to silence a warning
for #1710
7 years ago
Martin Kroeker
30f5a69ab8
Add explicit cast to silence a warning
for #1710
7 years ago
Martin Kroeker
fd081a91e4
Merge pull request #1759 from martin-frbg/lapack283
Remove an unused variable from several LAPACKE 2stage_work functions
7 years ago
Martin Kroeker
094f8c3b57
remove unused variable ldb_t
Copied from Reference-LAPACK PR283
7 years ago
Martin Kroeker
5cf090f516
remove unused variable ldb_t
Copied from Reference-LAPACK PR283
7 years ago
Martin Kroeker
58363542e7
remove unused variable ldb_t
Copied from Reference-LAPACK PR283
7 years ago
Martin Kroeker
3abc22a5bf
Merge pull request #1757 from brada4/develop
fix small typo in strmm_ LN
7 years ago
Andrew
1e531701b7
fix small typo
7 years ago
Martin Kroeker
5d42b6ea04
Merge pull request #1756 from martin-frbg/issue1754
Follow netlib renaming/aliasing CBLAS_ORDER to CBLAS_LAYOUT
7 years ago
Martin Kroeker
ba4f433321
Merge pull request #1749 from martin-frbg/issue1531
Fix ARMV8 cross-compilation for IOS
7 years ago
Martin Kroeker
4cf7315a5d
Adjust ARMV8 SGEMM unrolling when using the C fallback kernel_2x2 for IOS
7 years ago
Martin Kroeker
b57af93792
just make CBLAS_LAYOUT an alias of the existing CBLAS_ORDER
to avoid having to change all instances of enum CBLAS_ORDER in this file
7 years ago
Martin Kroeker
8aeab0601e
Follow netlib renaming/aliasing CBLAS_ORDER to CBLAS_LAYOUT
fixes #1754
7 years ago
Martin Kroeker
1cb7b9015e
Conditional compilation of assembly files that IOS does not like
7 years ago
Martin Kroeker
a4bd41e9f2
Fix paths to C kernels for nrm2
7 years ago
Martin Kroeker
9e2bb0c641
Update with the changes from 0.3.3
7 years ago
Martin Kroeker
dbfd7524cd
Update version to 0.3.4.dev
7 years ago
Martin Kroeker
2982ce505d
Update version to 0.3.4.dev
7 years ago
Martin Kroeker
5bac15adbd
Merge pull request #1746 from martin-frbg/issue1674
Assume cross-compilation if host and target os differ
7 years ago
Martin Kroeker
e17f969fa0
Assume cross-compilation if host and target os differ
fixes 1674
7 years ago
Martin Kroeker
e11126b26a
Merge pull request #1745 from martin-frbg/issue1743
Set USE_TRMM for all ZARCH variants to fix TRMM faults with zarch-gen…
7 years ago
Martin Kroeker
74608e470d
Merge pull request #1744 from martin-frbg/lapack272
Fix missing replacements of ILAENV by ILAENV_2STAGE (lapack PR 272)
7 years ago
Martin Kroeker
f3fd44a731
Set USE_TRMM for all ZARCH variants to fix TRMM faults with zarch-generic
fixes #1743
7 years ago
Martin Kroeker
9e917b16db
Fix missing replacements of ILAENV by ILAENV_2STAGE (lapack PR 272)
This could cause spurious "parameter has an illegal value" errors in DSYEVR and related routines, see https://github.com/Reference-LAPACK/lapack/issues/262
7 years ago
Martin Kroeker
8440a4cb1a
Merge pull request #1742 from martin-frbg/interim033
Add combination of old and new thread memory code selectable by new option USE_TLS
7 years ago
Martin Kroeker
b55690a659
typo fix
7 years ago
Martin Kroeker
b902a40986
Rewrite glibc version check
7 years ago
Martin Kroeker
5991d1a6cd
Update memory.c
7 years ago
Martin Kroeker
b1b743f434
Merge branch 'develop' into interim033
7 years ago
Martin Kroeker
2caa2210bb
Add USE_TLS option to choose between old and new implementation of memory.c
7 years ago
Martin Kroeker
2a589c4b28
Add USE_TLS option to switch between old and new memory.c
7 years ago
Martin Kroeker
fd42ca462d
Combo of default pre-0.3.1 memory.c and band-aided version of PR1739
7 years ago
Martin Kroeker
52d3f7af50
Merge pull request #1738 from sharkcz/s390x
detect z14 arch on s390x
7 years ago
Dan Horák
5c6e020f49
detect z14 arch on s390x
7 years ago
Martin Kroeker
d4d3113adc
Merge pull request #1731 from fenrus75/readme
add short blurb about avx512 and needed compiler to README
7 years ago
Martin Kroeker
375dff54fc
Merge pull request #1733 from fenrus75/dsymv
Add an AVX512 enabled DSYMV (L) function
7 years ago
Martin Kroeker
a5f165275a
Merge pull request #1732 from fenrus75/dgemv
Add an AVX512 enabled DGEMV (n) function
7 years ago
Martin Kroeker
8c13aa495a
Merge pull request #1730 from fenrus75/fix-sdot
Fix typo in sdot function
7 years ago
Martin Kroeker
1ee6d087c3
Merge pull request #1729 from fenrus75/dscal
Add an AVX512 enabled DSCAL function
7 years ago
Martin Kroeker
a95a784ab2
Merge pull request #1723 from maamountki/develop
Disable zgemv scale in gemv benchmark by default
7 years ago
Arjan van de Ven
9bec34cb67
Add an AVX512 enabled DSYMV (L) function
written in C intrinsics for best readability.
(the same C code works for Haswell as well)
For logistical reasons the code falls back to the existing
haswell AVX2 implementation if the GCC or LLVM compiler is not new enough
7 years ago