Martin Kroeker
fde8a8e6a0
Improve cmake build behaviour with non-host cpu targets ( #2246 )
1. Supply appropriate values for C/Z GEMM unroll when cross-compiling for CORE2 or ARMV7
2. Add the required xLOCAL_BUFFER_SIZE parameters for cross-compiling CORE2
3. Add -DFORCE_<target> option to getarch when building with -DTARGET=target
for #2245
6 years ago
Martin Kroeker
256fc15f5f
Merge pull request #2 from xianyi/develop
update
6 years ago
Martin Kroeker
ee498525e0
Merge pull request #2242 from martin-frbg/issue2235
Add arch data for cmake cross-compiling to CORE2
6 years ago
Martin Kroeker
1fec0570f6
Add cgemm and zgemm unroll factors for core2
6 years ago
Martin Kroeker
b5af7b9c78
Disable ppc64le test environment on Travis CI
as this semi-official beta option has suddenly reverted to a standard x86_64 environment causing spurious failures
6 years ago
Martin Kroeker
f3c314550c
Merge pull request #2243 from quickwritereader/develop
possible cgemv,caxpy,cdot fix
6 years ago
AbdelRauf
847c20c9b7
fix uninitialized variables i
6 years ago
AbdelRauf
4c22828812
caxpy and cdot are using vec_vsx_ld
6 years ago
AbdelRauf
e79712d969
cgemv using vec_vsx_ld instead of letting gcc to decide
6 years ago
AbdelRauf
be09551cdf
aligned
6 years ago
Martin Kroeker
ec1ef6aa9e
Merge pull request #2241 from martin-frbg/zdotfix
Make x86_64 zdot compile with PGI and Sun C again
6 years ago
Martin Kroeker
11c59acfb1
Keep both PGI/SUN and default code paths to avoid breaking Clang/WIndows
6 years ago
Martin Kroeker
bf0d92a310
Add arch data for cross-compiling to CORE2
for #2235
6 years ago
Martin Kroeker
db066151ee
Merge pull request #2240 from martin-frbg/issue2237
Fix PGI build options (again)
6 years ago
Martin Kroeker
3a55dca2dc
Make x86_64 zdot compile with PGI and Sun C again
broken by #2222 as CREAL,CIMAG do not expand to a valid lvalue with these compilers
6 years ago
Martin Kroeker
7d380f7d79
Fix PGI build options (again)
for #2237
6 years ago
Martin Kroeker
300f158d3b
Merge pull request #2239 from martin-frbg/issue2231
Fix 32bit armv8 compilation regression
6 years ago
Martin Kroeker
3635fdbf2b
Do not abuse the global ARCH variable as a local temporary
Setting it with a simple "uname -m" just to be able to decide whether to compile getarch.c with -march=native
may actually keep getarch from doing a proper probe. Fixes #2231 , a regression caused by #2110
6 years ago
Martin Kroeker
b6552b11eb
Merge pull request #2 from xianyi/develop
merge develop
6 years ago
Kavana Bhat
3dc6b26eff
AIX changes for Power8
6 years ago
Martin Kroeker
5fdf9ad24f
Merge pull request #2228 from martin-frbg/issue2227
Add Intel Goldmont Plus CPUID
6 years ago
Martin Kroeker
2fe967c542
Merge branch 'develop' into issue2227
6 years ago
Martin Kroeker
6d8595351c
Add Intel Goldmont Plus CPUID
fixes #2227
6 years ago
Martin Kroeker
f40200f559
Merge pull request #2223 from martin-frbg/getarch-pgi
Make getarch compile with PGI
6 years ago
Martin Kroeker
a95a5e52b8
Fix PGI compiler detection for getarch
6 years ago
Martin Kroeker
e3d846ab57
Do not use -march=native with the PGI compiler
6 years ago
Martin Kroeker
8506386d82
Merge pull request #1 from xianyi/develop
rebase
6 years ago
Martin Kroeker
9ef96b32a6
Add multithreading support to the x86_64 zdot kernel ( #2222 )
* Add multithreading support
copied from the ThunderX2T99 kernel. For #2221
6 years ago
Martin Kroeker
b48c025974
Merge pull request #2218 from martin-frbg/issue2215
Make the new DGEMM regression test properly depend on CBLAS and LAPACKE
6 years ago
Martin Kroeker
a1fce67743
Make the new DGEMM regression test properly depend on CBLAS and LAPACKE
fixes #2215
6 years ago
Martin Kroeker
103b32fdb7
Merge pull request #2216 from martin-frbg/issue2214
Remove case-sensitivity in x86 LSAME on (AMD) cpus without CMOV
6 years ago
Martin Kroeker
aef9804089
Fix unwanted case-sensitivity in x86 LSAME for (AMD) processors without CMOV
Problem was already noticed some years ago in #238 , but back then the problem was only corrected in one of the #ifdef branches.
Fixes #2214
6 years ago
Martin Kroeker
303869f572
Update with changes from 0.3.7
6 years ago
Martin Kroeker
02d9203981
Increment version to 0.3.8.dev
6 years ago
Martin Kroeker
7b6808b69c
Increment version to 0.3.8.dev
6 years ago
Martin Kroeker
5f36f18148
Update with 0.3.7 changes
6 years ago
Martin Kroeker
d47fe78b0e
Set version to 0.3.7
6 years ago
Martin Kroeker
ebe2f47a0f
Set version to 0.3.7
6 years ago
Martin Kroeker
20d417762f
Merge pull request #2213 from xianyi/develop
Update from develop in preparation of the 0.3.7 release
6 years ago
Martin Kroeker
321288597c
Merge pull request #2212 from martin-frbg/nofort-nolib
Avoid spurious dependency on the fortran runtime despite NOFORTRAN=1
6 years ago
Martin Kroeker
be147a9f28
Avoid adding a spurious dependency on the fortran runtime despite NOFORTRAN=1
for cases where a fortran compiler is present but not wanted (e.g. not fully functional)
6 years ago
Martin Kroeker
c275290ea6
Merge pull request #2211 from martin-frbg/arm64_gcc_trivial
Silence two nuisance warnings from gcc
6 years ago
Martin Kroeker
b7bbb02447
Silence two nuisance warnings from gcc
6 years ago
Martin Kroeker
bf1430f7d7
Merge pull request #2208 from martin-frbg/munmap-debug
Provide more information on mmap/munmap failure
6 years ago
Martin Kroeker
dccff2e785
Merge pull request #2206 from martin-frbg/zen-dtrmm
Replace vpermpd with vpermilpd in the Haswell DTRMM kernel
6 years ago
Martin Kroeker
5c3458a6e7
Merge pull request #2199 from martin-frbg/zen-dtrsm
Replace most vpermpd calls in the Haswell DTRSM_RN kernel
6 years ago
Martin Kroeker
1776ad82c0
Add files via upload
6 years ago
Martin Kroeker
4e2f81cfa1
Provide more information on mmap/munmap failure
for #2207
6 years ago
Martin Kroeker
acf6002ab2
Replace most vpermpd calls in the Haswell DTRSM_RN kernel
6 years ago
Martin Kroeker
96a794e9fd
Merge pull request #2198 from martin-frbg/icelake
Update CPUID recognition for Intel Ice Lake
6 years ago