Martin Kroeker
76ea7739dd
Merge pull request #3698 from martin-frbg/issue3697
utest needs to be linked against libm on QNX as well
3 years ago
Martin Kroeker
7aaa0ce0e8
utest needs to be linked against libm on QNX as well
3 years ago
Martin Kroeker
cd8e57040c
Merge pull request #3691 from martin-frbg/issue3679-sparc
SPARC: fix DNRM2 returning INF instead of zero due to intermediate overflow
3 years ago
Martin Kroeker
a4303ae378
Merge pull request #3695 from martin-frbg/ppc6nrm2
PPC6: Fix DNRM2 returning INF instead of zero due to intermediate overflow
3 years ago
Martin Kroeker
31377d04f0
Merge pull request #3694 from martin-frbg/traviswait
Add back travis_wait to keep ppc jobs from getting cancelled
3 years ago
Martin Kroeker
6c118b7977
Fix DNRM2 returning INF instead of zero due to intermediate overflow
3 years ago
Martin Kroeker
b60415a347
Add back travis_wait to keep ppc jobs from getting cancelled
3 years ago
Martin Kroeker
c43ec53bdd
Merge pull request #3690 from RajalakshmiSR/cdotp10
POWER: Fix complex dot function failures
3 years ago
Martin Kroeker
b7c65d08cb
Merge pull request #3689 from RajalakshmiSR/dgemvgcc10
POWER10: dgemv builtin rename
3 years ago
Martin Kroeker
fcbbd8c25c
Merge pull request #3682 from XiWeiGu/develop
Fix dnrm2_tiny testcase failure
3 years ago
Martin Kroeker
06ef015234
fix DNRM2 returning INF instead of zero due to intermediate overflow
3 years ago
Rajalakshmi Srinivasaraghavan
a612e78a97
POWER: Fix complex dot function failures
There are some test failures in complex dot functions when compiling with gcc12.
The machine constraints used now do not update all the four elements in the
expected result array. Fixing this with a reduced level of optimization.
This is not changing any performance numbers but will be converted to C code in future.
3 years ago
Rajalakshmi Srinivasaraghavan
432fd99445
POWER10: dgemv builtin rename
Add check to use correct builtin name for older versions
of gcc10 compilers.
3 years ago
gxw
4dd05e526b
LoongArch64: Fix dnrm2_tiny testcase failure
3 years ago
Martin Kroeker
7da799dc66
Merge pull request #3686 from martin-frbg/issue3685
Fix Fortran-less CTEST build option
3 years ago
Martin Kroeker
6e018b84c4
Fix function prototypes and INTERFACE64 support
3 years ago
Martin Kroeker
ccd87cc472
Fix switching between Fortran and C build
3 years ago
gxw
cce4b1d956
MIPS64: Fix dnrm2_tiny testcase failure
3 years ago
Martin Kroeker
7918ba11c2
Merge pull request #3680 from martin-frbg/issue3636-2
Guard against sysconf(__SC_NPROCESSORS_CONF) returning zero at runtime
3 years ago
Martin Kroeker
69148ae795
Guard against sysconf returning zero processors
3 years ago
Martin Kroeker
e9260f5451
Guard against system call returning zero processors
3 years ago
Martin Kroeker
4cfd6f110a
Merge pull request #3678 from martin-frbg/issue3677
Eliminate uses of CREAL on left-hand side of assignments
3 years ago
Martin Kroeker
e12d474780
Eliminate uses of CREAL on left-hand side of assignments
3 years ago
Martin Kroeker
686e6d7c10
Merge pull request #3676 from martin-frbg/dnrm2-utest
Add DNRM2 regression test for issues 2998 and 3654
3 years ago
Martin Kroeker
c5041ae270
properly embed test_dnrm2
3 years ago
Martin Kroeker
8e6f719ad3
use huge_val not huge_valf for portability
3 years ago
Martin Kroeker
af88494f87
old systems may not have inf in math.h
3 years ago
Martin Kroeker
ee41b6eb24
Add DNRM2 regression test for issues 2998 and 3654
3 years ago
Martin Kroeker
bf8998a9f4
Merge pull request #3675 from martin-frbg/issue3654
workaround ThunderX2 DNRM2 fault with ssq=inf,scale=0
3 years ago
Martin Kroeker
9e29598575
workaround fault with ssq=inf,scale=0
3 years ago
Martin Kroeker
3df3d622eb
Merge pull request #3672 from imzhuhl/neoversen2_bf16
sbgemm support for ARM Neoverse N2
3 years ago
Martin Kroeker
407a1a242c
Merge pull request #3670 from martin-frbg/osxvermin
Increase MACOSX_DEPLOYMENT_TARGET to 11 on ARM macs
3 years ago
Honglin Zhu
ec0d5c7a2a
Add gfortran parameters
3 years ago
Honglin Zhu
123e0dfb62
Neoverse N2 sbgemm:
1. Modify the algorithm to resolve multithreading failures
2. No memory allocation in sbgemm kernel
3. Optimize when alpha == 1.0f
3 years ago
Honglin Zhu
bc3728475f
format code
3 years ago
Honglin Zhu
55d686d41e
neoverse n2 sbgemm:
implement ncopy tcopy kernel_8x4
3 years ago
Honglin Zhu
04593bb27c
neoverse n2 sbgemm: init file
3 years ago
Martin Kroeker
1fb4259077
Merge pull request #3673 from martin-frbg/azuredynmingw
AzureCI: drop cpus from the DYNAMIC_LIST for Windows/mingw to save time
3 years ago
Martin Kroeker
47a0e53196
mingw-dynamic arch: drop Haswell too
3 years ago
Martin Kroeker
c7b3ce010e
drop NEHALEM from the DYNLIST for Windows/mingw to save time
3 years ago
Martin Kroeker
be5500e704
Merge pull request #3669 from VFerrari/fix_small_matrix_kernel
POWER: fix issues with the small matrix kernel
3 years ago
Martin Kroeker
92275a7902
Merge pull request #3642 from nursik/develop
Add ARM64 support for Windows
3 years ago
Martin Kroeker
914c4d0fe8
Add C versions of the CBLAS test sources ( #3656 )
* Add C conversions of the CBLAS tests for NOFORTRAN=1 builds
* Enable CTEST without Fortran and fix passing of BUILD_vartype options to exports/gensymbol
3 years ago
Martin Kroeker
2857987ff6
Increase MACOSX_DEPLOYMENT_TARGET to 11 on ARM macs
3 years ago
VFerrari
2062280c6f
Power: Enable SMALL_MATRIX OPT as default for dynamic arch
3 years ago
VFerrari
cac634fce3
POWER10: Fix multithreading check when USE_THREAD=0
This patch fixes an issue when OpenBLAS is compiled for TARGET=POWER10
and the flag USE_THREAD is set to 0.
The function `num_cpu_avail` is only available when USE_THREAD=1,
so SMP is defined.
3 years ago
Martin Kroeker
9283c7c0b5
Merge pull request #3655 from RajalakshmiSR/zgemmasmp10
POWER10: Fix ZGEMM testcase failures
3 years ago
Martin Kroeker
9777c59d98
Merge pull request #3653 from RajalakshmiSR/dgemvp10
POWER10: convert dgemv inline assembly
3 years ago
Rajalakshmi Srinivasaraghavan
f191bc652b
POWER10: Fix ZGEMM testcase failures
This patch fixes storing and restoring non volatile registers
in zgemm POWER10 kernel.
3 years ago
Martin Kroeker
7060ca5002
Merge pull request #3647 from martin-frbg/exports_3.10.0
Amend gensymbol with some LAPACK 3.10.0 additions
3 years ago