Martin Kroeker
|
f0f9e26090
|
fix typedef of logical to support INTERFACE64
|
1 year ago |
Martin Kroeker
|
547533fe9a
|
fix typedef of logical to support INTERFACE64
|
1 year ago |
Martin Kroeker
|
7bc0ff77bd
|
fix typedef of logical to support INTERFACE64
|
1 year ago |
Martin Kroeker
|
16a597927f
|
fix typedef of logical to support INTERFACE64
|
1 year ago |
Martin Kroeker
|
d277c6d15b
|
Merge pull request #4585 from martin-frbg/issue1881
Cap the number of parallel threads for GEMM;GETRF and POTRF to ensure sensible workloads on big systems
|
1 year ago |
Martin Kroeker
|
e1b618c6cf
|
Merge pull request #4598 from jip/fix
fix dtrtrs_ and ztrtrs_ to accept case-insensitive uplo and diag parameters
|
1 year ago |
Igor Zhuravlov
|
22d305e2df
|
fix dtrtrs_ and ztrtrs_ to accept case-insensitive parameters uplo and diag
Changes to be committed:
modified: interface/lapack/trtrs.c
modified: interface/lapack/ztrtrs.c
|
1 year ago |
Martin Kroeker
|
a87713f14b
|
Merge pull request #4597 from martin-frbg/issue4581
Add support for the ARM Cortex-A76 cpu
|
1 year ago |
Martin Kroeker
|
584e87661d
|
set SWITCH_RATIO for Cortex-A76
|
1 year ago |
Martin Kroeker
|
4718d97eb9
|
Mention support for Cortex-A76
|
1 year ago |
Martin Kroeker
|
b925f61fb0
|
Add support for Cortex-A76
|
1 year ago |
Martin Kroeker
|
3af736fb9d
|
Add support for Cortex-A76
|
1 year ago |
Martin Kroeker
|
441c81026e
|
Add support for Cortex-A76
|
1 year ago |
Martin Kroeker
|
58659730a4
|
Merge pull request #4595 from martin-frbg/issue4583-2
Temporarily revert S/DNRM2 on NeoverseN1 and Apple M to the older NEON kernel
|
1 year ago |
Martin Kroeker
|
9ead81bd39
|
Revert S/DNRM2 to the base NEON kernel to fix precision loss
|
1 year ago |
Martin Kroeker
|
29995b2969
|
Merge pull request #4594 from mattip/openblas.pc.il
fix typo
|
1 year ago |
Matti Picus
|
5b33e648b4
|
fix typo
|
1 year ago |
Martin Kroeker
|
b1e8ba5017
|
Merge pull request #4587 from mseminatore/win_queue_fix
Address Windows thread server re-entrant queue bug #4582
|
1 year ago |
Martin Kroeker
|
8267fcfda8
|
Merge pull request #4588 from XiWeiGu/loongarch_fixed_dzamax
loongarch: Fixed dzamax
|
1 year ago |
Mark Seminatore
|
b0ad8a78ff
|
code to fix lost work in case of re-entrant calls to exec_blas_async()
|
1 year ago |
Martin Kroeker
|
e1638ea43a
|
Merge pull request #4586 from martin-frbg/potrf-para
use atomic acq/rel operations in potrf_parallel as in the corresponding getrf_parallel
|
1 year ago |
Martin Kroeker
|
2dda40d280
|
use atomic operations as in the corresponding getrf
|
1 year ago |
Martin Kroeker
|
68ab5185d0
|
Update potrf.c
|
1 year ago |
Martin Kroeker
|
19b29b3448
|
Update getrf.c
|
1 year ago |
Martin Kroeker
|
a3354a7630
|
Cap the number of parallel threads
|
1 year ago |
Martin Kroeker
|
5da4c93ef2
|
Cap the number of parallel threads
|
1 year ago |
Martin Kroeker
|
496106642f
|
Cap the number of parallel threads
|
1 year ago |
gxw
|
96607cbb98
|
loongarch: Fixed dzamax
Initialize the registers to prevent sporadic errors.
|
1 year ago |
Martin Kroeker
|
9af2a9dc3b
|
Merge pull request #4579 from ChipKerchner/fixInializerPriority
Fix global (static) constructor priority so that OpenBLAS gets initialized before other libraries. Other unit test AIX fix.
|
1 year ago |
Chip Kerchner
|
0e0d0bce1a
|
Fix global (static) constructor priorty so that OpenBLAS gets initialized before other libraries. Other unit test AIX fix.
|
1 year ago |
Martin Kroeker
|
4059a75c9c
|
Merge pull request #4578 from jerryz123/patch-1
Fix README formatting error
|
1 year ago |
Jerry Zhao
|
0b814ab8b9
|
Fix README formatting error
|
1 year ago |
Martin Kroeker
|
87f83ebe9c
|
Merge pull request #4575 from martin-frbg/fixup4503
Restore outer loop of blas_buffer_inuse setup for parallel OpenMP
|
1 year ago |
Martin Kroeker
|
88b5330ae7
|
Restore outer loop of blas_buffer_inuse setup
|
1 year ago |
Martin Kroeker
|
52b71a1673
|
Filter out FFLAGS that flang-new from LLVM18 no longer supports (#4569)
* Filter out FFLAGS that flang-new from LLVM18 no longer supports
|
1 year ago |
Martin Kroeker
|
3d2a9e4a61
|
Merge pull request #4567 from RajalakshmiSR/P9AIX
POWER9: Use default param values from POWER8 on AIX
|
1 year ago |
Martin Kroeker
|
9ad9b52884
|
Merge pull request #4568 from martin-frbg/fixbenchloops
Fix bad assignment of OPENBLAS_LOOPS variable in several benchmarks
|
1 year ago |
Martin Kroeker
|
3f1ec74fe7
|
Fix OPENBLAS_LOOPS assignment
|
1 year ago |
Martin Kroeker
|
fe39c891a6
|
Fix OPENBLAS_LOOPS assignment
|
1 year ago |
Martin Kroeker
|
ffcbaca167
|
Fix OPENBLAS_LOOPS assignment
|
1 year ago |
Martin Kroeker
|
05d0438c25
|
Fix OPENBLAS_LOOPS assignment
|
1 year ago |
Rajalakshmi Srinivasaraghavan
|
f5b2a877e2
|
POWER9: Use default param values from POWER8 on AIX
AIX uses KERNEL.POWER8 optimization on POWER9 and changing
the default GEMM parameters in param.h to use POWER8 values
on POWER9.
|
1 year ago |
Martin Kroeker
|
b4a1153648
|
Merge pull request #4566 from XiWeiGu/fix_loongarch_lsx
LoongArch: Fixed LSX opt
|
1 year ago |
gxw
|
50869f6ca8
|
loongarch: Fixed zrot LSX opt
|
1 year ago |
gxw
|
b5eb9d6bac
|
loongarch: Fixed {sc/dz}amax LSX opt
|
1 year ago |
gxw
|
ad13e04669
|
loongarch: Fixed {s/d/sc/dz}amin LSX opt
|
1 year ago |
gxw
|
bbf82cb624
|
loongarch: Fixed {s/d}axpby LSX opt
|
1 year ago |
gxw
|
ac460eb42a
|
loongarch: Fixed i{c/z}amin LSX opt
|
1 year ago |
Martin Kroeker
|
56d114b245
|
Merge pull request #4565 from martin-frbg/issue4564
Fix argument lists of RELAPACK_?gemmt for good
|
1 year ago |
Martin Kroeker
|
2e9ce9bb07
|
Fix argument lists of RELAPACK_?gemmt for good
|
1 year ago |