Martin Kroeker
25ce2e2a63
Update with 0.3.21 changes
3 years ago
Martin Kroeker
9f89b62b25
Merge pull request #3715 from martin-frbg/issue3648
Increase thresholds for STFSM and CTFSM in the LAPACK testsuite
3 years ago
Martin Kroeker
9c4e91a77d
Increase threshold
3 years ago
Martin Kroeker
1fe06caf49
Increase threshold
3 years ago
Martin Kroeker
ff58e9a7f1
Merge pull request #3609 from martin-frbg/lapack3101
Update LAPACK/LAPACKE to Reference-LAPACK 3.10.1
3 years ago
Martin Kroeker
f6a1854ce9
resync gensymbol with develop
3 years ago
Martin Kroeker
2bee490287
Merge pull request #3714 from martin-frbg/crosscmake
Add more x86_64 target definitions for CMAKE cross-compiling
3 years ago
Martin Kroeker
85fd3c4279
Support compilation with the Cray C and Fortran compilers ( #3712 )
* Add support for the Cray Fortran compiler
3 years ago
Martin Kroeker
3784b3d45c
Add more x86_64 target definitions for cross-compiling
3 years ago
Martin Kroeker
096ae6f2bd
Merge pull request #3709 from nursik/develop
Add TCORE Generic
3 years ago
Martin Kroeker
19fefd100e
Merge pull request #3703 from martin-frbg/omp_adaptive
Add env variable OMP_ADAPTIVE to control OMP threadpool behaviour
3 years ago
Martin Kroeker
2e51a61914
Merge pull request #3693 from Mayank-Raj3/Mayank-Raj3-patch-1
corrected indentation of for and if statement dgemv_thread_safety.cpp
3 years ago
Nursultan Zarlyk
a7ac252fd9
Add TCORE Generic in prebuild.cmake
During the cross-compilation on x64 host with MSVC for ARMv8, the
build fails as there is no define directives for Generic core.
3 years ago
Martin Kroeker
648a69a67e
Merge pull request #3707 from martin-frbg/getarch_risc
Fix crash in RISCV autodetection when pmodel is not present in /proc/cpuinfo
3 years ago
Martin Kroeker
ef9c976a94
Really fix compilation; fix crash when pmodel is not present in cpuinfo
3 years ago
Martin Kroeker
f727235be4
Merge pull request #3706 from martin-frbg/czifunding
Acknowledge past CZI EOSS 1/EOSS 3 funding
3 years ago
Martin Kroeker
880bc1d1db
Acknowledge past CZI EOSS 1/EOSS 3 funding
3 years ago
Martin Kroeker
d0ba257de0
Merge pull request #3704 from XiWeiGu/loongarch64_dynamic_arch
LoongArch64: Add DYNAMIC_ARCH support
3 years ago
Martin Kroeker
78da6a750a
Merge pull request #3705 from RajalakshmiSR/bf16ppc
POWER: Enable bfloat16 kernels by default
3 years ago
Rajalakshmi Srinivasaraghavan
1d97405c02
POWER: Enable bfloat16 kernels by default
This patch enables bfloat16 kernels by default for POWER processors.
Tested on Linux POWER8, POWER9, POWER10 and AIX POWER10 systems.
3 years ago
gxw
fbfe1daf6e
LoongArch64: Add DYNAMIC_ARCH support
3 years ago
Martin Kroeker
80cdfed7b2
Use OMP_ADAPTIVE setting to choose between static and dynamic OMP threadpool size
3 years ago
Martin Kroeker
08e3754b39
Add environment variable OMP_ADAPTIVE
3 years ago
Martin Kroeker
047a279f09
Merge pull request #3702 from martin-frbg/issue3687
Add openblas_getaffinity() extension (Linux-only)
3 years ago
Martin Kroeker
30473b6a9d
add openblas_getaffinity()
3 years ago
Martin Kroeker
8668571040
add openblas_getaffinity()
3 years ago
Martin Kroeker
daca01622b
fix detection of Neoverse V1 and user-enforced selection of N2 in ARM64 DYNAMIC_ARCH ( #3700 )
* fix detection of Neoverse V1 and user-enforced selection of N2
3 years ago
Martin Kroeker
c322aab685
Merge pull request #3684 from imzhuhl/neoversen2_dynamic_arch
Neoverse N2: DYNAMIC_ARCH
3 years ago
Martin Kroeker
cf796aee8c
Merge pull request #3699 from martin-frbg/issue3692
Add c_check recognition of Fujitsu fcc for Fugaku A64FX
3 years ago
Martin Kroeker
28d40ba60b
Merge pull request #3696 from XiWeiGu/loongson2k1000
LoongArch64: Add core LOONGSON2K1000 and LOONGSONGENERIC
3 years ago
Martin Kroeker
692848d20c
typo fix
3 years ago
Martin Kroeker
76ea7739dd
Merge pull request #3698 from martin-frbg/issue3697
utest needs to be linked against libm on QNX as well
3 years ago
Martin Kroeker
f8c5bdfbab
Treat Fujitsu fcc on Fugaku like clang
3 years ago
Martin Kroeker
70001e1e9e
Add Fujitsu compiler
3 years ago
Martin Kroeker
cf37182260
Add Fujitsu compiler (fcc)
3 years ago
Martin Kroeker
68d86ea150
Add Fujitsu compiler
3 years ago
Martin Kroeker
7aaa0ce0e8
utest needs to be linked against libm on QNX as well
3 years ago
Martin Kroeker
cd8e57040c
Merge pull request #3691 from martin-frbg/issue3679-sparc
SPARC: fix DNRM2 returning INF instead of zero due to intermediate overflow
3 years ago
gxw
3573306a69
LoongArch64: Add core LOONGSON2K1000 and LOONGSONGENERIC
3 years ago
Martin Kroeker
a4303ae378
Merge pull request #3695 from martin-frbg/ppc6nrm2
PPC6: Fix DNRM2 returning INF instead of zero due to intermediate overflow
3 years ago
Martin Kroeker
31377d04f0
Merge pull request #3694 from martin-frbg/traviswait
Add back travis_wait to keep ppc jobs from getting cancelled
3 years ago
Martin Kroeker
6c118b7977
Fix DNRM2 returning INF instead of zero due to intermediate overflow
3 years ago
Martin Kroeker
b60415a347
Add back travis_wait to keep ppc jobs from getting cancelled
3 years ago
Mayank Raj
a9939111d7
Update dgemv_thread_safety.cpp
3 years ago
Martin Kroeker
c43ec53bdd
Merge pull request #3690 from RajalakshmiSR/cdotp10
POWER: Fix complex dot function failures
3 years ago
Martin Kroeker
b7c65d08cb
Merge pull request #3689 from RajalakshmiSR/dgemvgcc10
POWER10: dgemv builtin rename
3 years ago
Martin Kroeker
fcbbd8c25c
Merge pull request #3682 from XiWeiGu/develop
Fix dnrm2_tiny testcase failure
3 years ago
Martin Kroeker
06ef015234
fix DNRM2 returning INF instead of zero due to intermediate overflow
3 years ago
Rajalakshmi Srinivasaraghavan
a612e78a97
POWER: Fix complex dot function failures
There are some test failures in complex dot functions when compiling with gcc12.
The machine constraints used now do not update all the four elements in the
expected result array. Fixing this with a reduced level of optimization.
This is not changing any performance numbers but will be converted to C code in future.
3 years ago
Rajalakshmi Srinivasaraghavan
432fd99445
POWER10: dgemv builtin rename
Add check to use correct builtin name for older versions
of gcc10 compilers.
3 years ago