Martin Kroeker
f0f9e26090
fix typedef of logical to support INTERFACE64
2 years ago
Martin Kroeker
547533fe9a
fix typedef of logical to support INTERFACE64
2 years ago
Martin Kroeker
7bc0ff77bd
fix typedef of logical to support INTERFACE64
2 years ago
Martin Kroeker
16a597927f
fix typedef of logical to support INTERFACE64
2 years ago
Martin Kroeker
d277c6d15b
Merge pull request #4585 from martin-frbg/issue1881
Cap the number of parallel threads for GEMM;GETRF and POTRF to ensure sensible workloads on big systems
2 years ago
Martin Kroeker
e1b618c6cf
Merge pull request #4598 from jip/fix
fix dtrtrs_ and ztrtrs_ to accept case-insensitive uplo and diag parameters
2 years ago
Igor Zhuravlov
22d305e2df
fix dtrtrs_ and ztrtrs_ to accept case-insensitive parameters uplo and diag
Changes to be committed:
modified: interface/lapack/trtrs.c
modified: interface/lapack/ztrtrs.c
2 years ago
Martin Kroeker
a87713f14b
Merge pull request #4597 from martin-frbg/issue4581
Add support for the ARM Cortex-A76 cpu
2 years ago
Martin Kroeker
584e87661d
set SWITCH_RATIO for Cortex-A76
2 years ago
Martin Kroeker
4718d97eb9
Mention support for Cortex-A76
2 years ago
Martin Kroeker
b925f61fb0
Add support for Cortex-A76
2 years ago
Martin Kroeker
3af736fb9d
Add support for Cortex-A76
2 years ago
Martin Kroeker
441c81026e
Add support for Cortex-A76
2 years ago
Martin Kroeker
58659730a4
Merge pull request #4595 from martin-frbg/issue4583-2
Temporarily revert S/DNRM2 on NeoverseN1 and Apple M to the older NEON kernel
2 years ago
Martin Kroeker
9ead81bd39
Revert S/DNRM2 to the base NEON kernel to fix precision loss
2 years ago
Martin Kroeker
29995b2969
Merge pull request #4594 from mattip/openblas.pc.il
fix typo
2 years ago
shivammonaka
7102367fde
Introduced callback to Pthread, Win32 and OpenMP backend
2 years ago
Matti Picus
5b33e648b4
fix typo
2 years ago
Martin Kroeker
b1e8ba5017
Merge pull request #4587 from mseminatore/win_queue_fix
Address Windows thread server re-entrant queue bug #4582
2 years ago
Martin Kroeker
8267fcfda8
Merge pull request #4588 from XiWeiGu/loongarch_fixed_dzamax
loongarch: Fixed dzamax
2 years ago
Mark Seminatore
b0ad8a78ff
code to fix lost work in case of re-entrant calls to exec_blas_async()
2 years ago
Martin Kroeker
e1638ea43a
Merge pull request #4586 from martin-frbg/potrf-para
use atomic acq/rel operations in potrf_parallel as in the corresponding getrf_parallel
2 years ago
Martin Kroeker
2dda40d280
use atomic operations as in the corresponding getrf
2 years ago
Martin Kroeker
68ab5185d0
Update potrf.c
2 years ago
Martin Kroeker
19b29b3448
Update getrf.c
2 years ago
Martin Kroeker
a3354a7630
Cap the number of parallel threads
2 years ago
Martin Kroeker
5da4c93ef2
Cap the number of parallel threads
2 years ago
Martin Kroeker
496106642f
Cap the number of parallel threads
2 years ago
gxw
96607cbb98
loongarch: Fixed dzamax
Initialize the registers to prevent sporadic errors.
2 years ago
Martin Kroeker
9af2a9dc3b
Merge pull request #4579 from ChipKerchner/fixInializerPriority
Fix global (static) constructor priority so that OpenBLAS gets initialized before other libraries. Other unit test AIX fix.
2 years ago
Chip Kerchner
0e0d0bce1a
Fix global (static) constructor priorty so that OpenBLAS gets initialized before other libraries. Other unit test AIX fix.
2 years ago
Martin Kroeker
4059a75c9c
Merge pull request #4578 from jerryz123/patch-1
Fix README formatting error
2 years ago
Jerry Zhao
0b814ab8b9
Fix README formatting error
2 years ago
Martin Kroeker
87f83ebe9c
Merge pull request #4575 from martin-frbg/fixup4503
Restore outer loop of blas_buffer_inuse setup for parallel OpenMP
2 years ago
Martin Kroeker
88b5330ae7
Restore outer loop of blas_buffer_inuse setup
2 years ago
Martin Kroeker
52b71a1673
Filter out FFLAGS that flang-new from LLVM18 no longer supports ( #4569 )
* Filter out FFLAGS that flang-new from LLVM18 no longer supports
2 years ago
Martin Kroeker
3d2a9e4a61
Merge pull request #4567 from RajalakshmiSR/P9AIX
POWER9: Use default param values from POWER8 on AIX
2 years ago
Martin Kroeker
9ad9b52884
Merge pull request #4568 from martin-frbg/fixbenchloops
Fix bad assignment of OPENBLAS_LOOPS variable in several benchmarks
2 years ago
Martin Kroeker
3f1ec74fe7
Fix OPENBLAS_LOOPS assignment
2 years ago
Martin Kroeker
fe39c891a6
Fix OPENBLAS_LOOPS assignment
2 years ago
Martin Kroeker
ffcbaca167
Fix OPENBLAS_LOOPS assignment
2 years ago
Martin Kroeker
05d0438c25
Fix OPENBLAS_LOOPS assignment
2 years ago
Rajalakshmi Srinivasaraghavan
f5b2a877e2
POWER9: Use default param values from POWER8 on AIX
AIX uses KERNEL.POWER8 optimization on POWER9 and changing
the default GEMM parameters in param.h to use POWER8 values
on POWER9.
2 years ago
Matthew Barber
32c31ec82a
Build docs to `docs/` in GitHub pages
2 years ago
Matthew Barber
45cbabcf58
Improve `index.md`
* Fix typo
* Improve platforms sentence
2 years ago
Matthew Barber
51a465a162
Mailing list doc updates
2 years ago
Matthew Barber
7a8b1d40a2
Address review comments
* Update backers link to OpenBLAS repo
* Remove reference to Bountysource
2 years ago
Matthew Barber
27841b8b44
More doc improvements
* Fold "Users of OpenBLAS", "Publications", "Mailing list", "Donation" pages into "About"
* Remove full license
* Remove outdated TODO and machine list pages
* Shorter titles for CI and distributing pages
* Link to GotoBLAS readme in repo
* Brief description of OpenBLAS in homepage
* Move acknowledgements into About page
* Remove manual TOC in FAQ docs
2 years ago
Matthew Barber
9cfcee131e
Doc improvements squish
* Fix more broken links
* Move FAQ last
* Fold all install-y content into `install.md`
* Fix header levels
* Fold `download.md` contents into `install.md`
* Fix cross-references
* Ordered navigation
* Add logo to docs
* Make doc pages more consistent
2 years ago
Martin Kroeker
b4a1153648
Merge pull request #4566 from XiWeiGu/fix_loongarch_lsx
LoongArch: Fixed LSX opt
2 years ago