Martin Kroeker
a6a8cc2b7f
Fix errors in cpu enumeration with glibc 2.6
for #2114
6 years ago
Martin Kroeker
91943b7325
Merge pull request #2096 from martin-frbg/eig-testing
Avoid out-of-bounds accesses in LAPACK EIG tests
6 years ago
Martin Kroeker
268c28db7d
Merge pull request #2095 from martin-frbg/trsm
Correct length of name string in xerbla call
6 years ago
Martin Kroeker
2aad88d5b9
Avoid out-of-bounds accesses in LAPACK EIG tests
see https://github.com/Reference-LAPACK/lapack/issues/333
6 years ago
Martin Kroeker
0bd956fd21
Correct length of name string in xerbla call
6 years ago
Martin Kroeker
bbd9d98664
Merge pull request #2094 from martin-frbg/issue2066
Fix ReLAPACK integration problems
6 years ago
Martin Kroeker
798c448b0c
Add support for INTERFACE64 and fix XERBLA calls
1. Replaced all instances of "int" with "blasint"
2. Added string length as "hidden" third parameter in calls to fortran XERBLA
6 years ago
Martin Kroeker
9a19616a28
Support INTERFACE64=1
6 years ago
Martin Kroeker
6b41eb9c0c
Merge pull request #2092 from jeffbaylor/snprintf_with_MSC_VER
snprintf define consolidated to common.h
6 years ago
Martin Kroeker
ccfb7ead15
Merge pull request #2072 from martin-frbg/sum
Add (C)BLAS extension ?sum
6 years ago
Jeff Baylor
40e53e52d6
snprintf define consolidated to common.h
6 years ago
Martin Kroeker
744779d335
Merge pull request #2084 from RashmicaG/develop
Add in runtime CPU detection for POWER.
6 years ago
Rashmica Gupta
bcdf1d4917
Add in runtime CPU detection for POWER.
6 years ago
Martin Kroeker
e06b8438b4
Merge pull request #2080 from martin-frbg/issue2075
Add -lm and disable EXPRECISION support on *BSD
6 years ago
Martin Kroeker
9229d6859b
Add -lm and disable EXPRECISION support on *BSD
fixes #2075
6 years ago
Martin Kroeker
21d146a8de
Add declarations for ?sum
6 years ago
Martin Kroeker
7f4e36d219
Merge pull request #2073 from martin-frbg/issue2056-2
Detect 32bit environment on 64bit ARM hardware
6 years ago
Martin Kroeker
c04a729081
Add ?sum definitions for generic kernel
6 years ago
Martin Kroeker
100d94f94e
Add ?sum
6 years ago
Martin Kroeker
d17da6c6a4
Add cmake defaults for ?sum kernels
6 years ago
Martin Kroeker
1679de5e59
Detect 32bit environment on 64bit ARM hardware
for #2056 , using same approach as #2058
6 years ago
Martin Kroeker
246ca29679
Add ZARCH implementation of ?sum
as trivial copies of the respective ?asum kernels with the ABS and vflpsb calls removed
6 years ago
Martin Kroeker
9d717cb5ee
Add x86_64 implementation of ?sum
as trivial copy of ?asum with the fabs calls removed
6 years ago
Martin Kroeker
e3bc83f2a8
Add x86 implementation of ?sum
as trivial copy of ?asum with the fabs calls removed
6 years ago
Martin Kroeker
70f2a4e0d7
Add SPARC implementation of ?sum
as trivial copy of ?asum with the fabs replaced by fmov to preserve code structure
6 years ago
Martin Kroeker
706dfe263b
Add POWER implementation of ?sum
as trivial copy of ?asum with the fabs replaced by fmr to preserve code structure
6 years ago
Martin Kroeker
688fa9201c
Add MIPS64 implementation of ?sum
as trivial copy of ?asum with the fabs replaced by mov to preserve code structure
6 years ago
Martin Kroeker
cdbe0f0235
Add MIPS implementation of ?sum
as trivial copy of ?asum with the fabs calls removed
6 years ago
Martin Kroeker
f8b82bc6dc
Add ia64 implementation of ?sum
as trivial copy of asum with the fabs calls removed
6 years ago
Martin Kroeker
3e3ccb9011
Add ARM64 implementations of ?sum
as trivial copies of the respective ?asum kernels with the fabs calls removed
6 years ago
Martin Kroeker
94ab4e6fb2
Add ARM implementations of ?sum
(trivial copies of the respective ?asum with the fabs calls removed)
6 years ago
Martin Kroeker
c3cfc6986b
Add implementations of ssum/dsum and csum/zsum
as trivial copies of asum/zsasum with the fabs calls replaced by fmov to preserve code structure
6 years ago
Martin Kroeker
b9f4943a14
Add ?sum
6 years ago
Martin Kroeker
79cfc24a62
Add interface for ?sum (derived from ?asum)
6 years ago
Martin Kroeker
5c42287c4f
Add declarations for ?sum and cblas_?sum
6 years ago
Martin Kroeker
32c7063cb0
Merge pull request #2061 from martin-frbg/martin-frbg-patch-1
Disable the AVX512 DGEMM kernel (again)
6 years ago
Martin Kroeker
c19a449096
Merge pull request #2071 from martin-frbg/issue2068
Provide CBLAS interfaces to I?MIN and I?MAX
6 years ago
Martin Kroeker
3d1e36d4cb
Build CBLAS interfaces for I?MIN and I?MAX
6 years ago
Martin Kroeker
4f9d3e4b28
Expose CBLAS interfaces for I?MIN and I?MAX
6 years ago
Martin Kroeker
4dec151d0b
Merge pull request #2070 from quickwritereader/develop
power9 makefile. dgemm based on power8 kernel with following changes …
6 years ago
Martin Kroeker
7c51cc8527
Merge branch 'develop' into develop
6 years ago
AbdelRauf
853a18bc17
power9 makefile. dgemm based on power8 kernel with following changes : 32x unrolled 16x4 kernel and 8x4 kernel using (lxv stxv butterfly rank1 update). improvement from 17 to 22-23gflops. dtrmm cases were added into dgemm itself
6 years ago
Martin Kroeker
3ae122e2c7
Merge pull request #2069 from aixoss/aix-asm-change
AIX asm syntax changes needed for shared object creation
6 years ago
Ayappan P
b043a5962e
AIX asm syntax changes needed for shared object creation
6 years ago
Martin Kroeker
8502030e5e
Merge pull request #2064 from embray/cygwin/use-tls-thread-memory-cleanup
Fix for #2063
6 years ago
Erik M. Bray
8ba9e2a61a
Also call CloseHandle on each thread, as well as on the event so as to not leak thread handles.
6 years ago
Erik M. Bray
4ad694eda1
Fix for #2063 : The DllMain used in Cygwin did not run the thread memory
pool cleanup upon THREAD_DETACH which is needed when compiled with
USE_TLS=1.
6 years ago
Martin Kroeker
dff4a197a5
Merge pull request #2058 from xsacha/patch-3
Change 64-bit detection as explained in #2056
6 years ago
Martin Kroeker
a5425575b1
Merge pull request #2060 from embray/cygwin/readenv
Use POSIX getenv on Cygwin
6 years ago
Erik M. Bray
1006ff8a7b
Use POSIX getenv on Cygwin
The Windows-native GetEnvironmentVariable cannot be relied on, as
Cygwin does not always copy environment variables set through Cygwin
to the Windows environment block, particularly after fork().
6 years ago