Martin Kroeker
1abafcd9b2
handle corner cases involving NAN and/or INF
1 year ago
Martin Kroeker
ffc1ab3f6e
Test corner cases of all SCAL variants
1 year ago
Martin Kroeker
020b3e1682
fix handling of INF arguments
1 year ago
Martin Kroeker
56bd57ca99
Merge pull request #4720 from martin-frbg/issue3039
Resurrect and complete cblas_?gemm_batch
1 year ago
Martin Kroeker
6b564d53fd
Merge pull request #4727 from martin-frbg/issue4726
Fix another corner case of infinity handling in x86_64 ZSCAL
1 year ago
Martin Kroeker
db070a9223
add gemm_batch drivers
1 year ago
Martin Kroeker
076766df4e
Update CMakeLists.txt
1 year ago
Martin Kroeker
8c05765a5a
fix other corner cases where x=INF
1 year ago
Martin Kroeker
516743f7dc
fix other instances of mishandling INF
1 year ago
Martin Kroeker
9ff4e9714e
additional fixes for handling INF arguments
1 year ago
Martin Kroeker
ce130f11d2
Update zscal.c
1 year ago
Martin Kroeker
ab13cfef93
more fixes for infinite x
1 year ago
Martin Kroeker
a16f8249ba
add tests with the imaginary part of the array infinite
1 year ago
Martin Kroeker
ad2b5c67c8
fix another corner case involving infinity
1 year ago
Martin Kroeker
0d007adb18
fix clang_cl-flang job to use flang-new after the llvm update
1 year ago
Martin Kroeker
b9a1c9a06c
Merge pull request #4725 from Neumann-A/patch-1
Fix CMake warning
1 year ago
Martin Kroeker
ff6670cb83
don't generate non-cblas files for gemm_batch
1 year ago
Alexander Neumann
dd4505c5dd
Fix CMake warning
1 year ago
Martin Kroeker
362a063396
remove return value
1 year ago
Martin Kroeker
d0794f88dc
add gemm_batch driver
1 year ago
Martin Kroeker
833a8880c6
add cblas_?gemm_batch
1 year ago
Martin Kroeker
89c7bbcba6
add cblas_?gemm_batch
1 year ago
Martin Kroeker
103637887e
add cblas_?gemm_batch
1 year ago
Martin Kroeker
0073affe63
Merge pull request #4693 from goplanid/locks-improvement
Lock Management Improvements for Memory Allocation Efficiency
1 year ago
Martin Kroeker
834e633d79
Merge pull request #4718 from martin-frbg/issue4713
Override Intel icx's default fp-model to ensure correct handling on NaNs
1 year ago
Martin Kroeker
3833190454
Merge pull request #4716 from martin-frbg/lapack1018
Fix a potential bounds error in ?UNHR_COL/?ORHR_COL (Reference-LAPACK PR 1018)
1 year ago
Martin Kroeker
cf7e668fe8
Merge pull request #4709 from martin-frbg/docsbuildbranch
Don't try to deploy docs when PR-building in a fork
1 year ago
Martin Kroeker
8b4996a2d5
Override icx's default fast math mode to ensure correct NaN handling
1 year ago
Martin Kroeker
616cc28d82
Override icx's default fast math mode to ensure correct NaN handling
1 year ago
Martin Kroeker
772116879d
Merge pull request #4717 from bartoldeman/zscal-float-inf-fix
Replace use of FLT_MAX in x86_64 zscal.c by isinf()
1 year ago
Bart Oldeman
62f7b244ff
Replace use of FLT_MAX in x86_64 zscal.c by isinf()
Commit def4996 fixed issues with inf and nan values in zscal,
but used FLT_MAX, where DBL_MAX or isinf() is more appropriate,
as FLT_MAX is for single precision only.
Using FLT_MAX caused test case failures in the LAPACK tests.
isinf() is consistent with the later fix 969601a1
1 year ago
Martin Kroeker
7ebbe3cc72
Fix potential bounds error (Reference-LAPACK PR 1018)
1 year ago
Martin Kroeker
791e015024
Fix potential bounds error (Reference-LAPACK PR 1018)
1 year ago
Martin Kroeker
4dd715d220
Fix potential bounds error (Reference-LAPACK PR 1018)
1 year ago
Martin Kroeker
e2c1a1e269
Fix potential bounds error (Reference-LAPACK PR 1018)
1 year ago
Martin Kroeker
172d91846f
Don't try to deploy docs in a fork
1 year ago
Martin Kroeker
700ea74a37
Merge pull request #4705 from martin-frbg/issue4703
Fix INTERFACE64 builds on Loongarch64
1 year ago
Martin Kroeker
aa259b141d
Merge pull request #4704 from amritahs-ibm/saxpy_perf_fix
Fix regression SAXPY when compiler with OpenXL compiler.
1 year ago
Martin Kroeker
25b34e67f9
Merge pull request #4678 from ev-br/codspeed
WIP: add codspeed benchmarks [skip cirrus]
1 year ago
Martin Kroeker
6494f432df
Fix INTERFACE64 builds on Loongarch64
1 year ago
Evgeni Burovski
81cf0db047
DOC: add a readme for benchmarks/pybench
1 year ago
Evgeni Burovski
9f28161837
BENCH: add benchmarks using codspeed.io
1 year ago
Martin Kroeker
5015548d18
Merge pull request #4700 from martin-frbg/fix4698
Remove spurious brace in cmake/system.cmake
1 year ago
Martin Kroeker
ce96e0e50f
Merge pull request #4699 from ChipKerchner/fixSwapVectorOrder
POWER: Fixing endianness issue in cswap/zswap kernel for AIX
1 year ago
Martin Kroeker
a3f6b13bc9
remove spurious brace
1 year ago
Chip Kerchner
3a1417671a
POWER: Fixing endianness issue in cswap/zswap kernel for AIX
1 year ago
Martin Kroeker
668f48f4fc
Use CMAKE_C_COMPILER_VERSION instead of dumpversion calls ( #4698 )
* Use CMAKE_C_COMPILER_VERSION throughout
1 year ago
Martin Kroeker
39c96063fb
Merge pull request #4694 from martin-frbg/issue3660
Add a minimum problem size for multithreading in GBMV
1 year ago
Martin Kroeker
f5c080f083
Fix CMAKE syntax in kernel file parsing of IFNEQ conditionals ( #4695 )
* Fix syntax in parsing of IFNEQ
1 year ago
Martin Kroeker
9a2a6a2e52
Merge pull request #4696 from frjohnst/restore_second
Revert PRs 4515 and 4520 (restore second, dsecnd)
1 year ago