Martin Kroeker
2a97ca615f
MSVC compatibility fixes
3 years ago
Martin Kroeker
5dec93e93b
Complete the C conversion of the xTRSYL3 files
3 years ago
Martin Kroeker
b6d74b7fff
Add f2c-converted files for the BLAS3-based Sylvester solver
3 years ago
Martin Kroeker
b2cc310470
Add f2c-converted versions of the new BLAS3-based Sylvester solver
3 years ago
Martin Kroeker
379efbe5af
Fix typos
3 years ago
Martin Kroeker
95da5141f0
Add a BLAS3-based triangular Sylvester equation solver (Reference-LAPACK PR 651)
3 years ago
Martin Kroeker
2592853fc7
Restore OpenBLAS-specific changes
3 years ago
Martin Kroeker
52c2a0397b
Restore OpenBLAS modifications to link line
3 years ago
Martin Kroeker
bb652f65a3
Typo fix
3 years ago
Martin Kroeker
fb42a0cf8b
Add a BLAS3-based triangular Sylvester equation solver (Reference-LAPACK PR 651)
3 years ago
Martin Kroeker
13f3bbece1
Add a BLAS3-based triangular Sylvester equation solver (Reference-LAPACK PR 651)
3 years ago
Martin Kroeker
92174725d9
Add a BLAS3-based triangular Sylvester equation solver (Reference-LAPACK PR 651)
3 years ago
Martin Kroeker
6eb707d941
Add a BLAS3-based triangular Sylvester equation solver (Reference-LAPACK PR 651)
3 years ago
Martin Kroeker
7eb2653268
Add a BLAS3-based triangular Sylvester equation solver (Reference-LAPACK PR 651)
3 years ago
Martin Kroeker
4bc918a791
Add a BLAS3-based triangular Sylvester equation solver (Reference-LAPACK PR 651)
3 years ago
Martin Kroeker
35dac5677a
Merge pull request #3816 from martin-frbg/lapack638
Fix workspace calculation in GEQRF/GERQF (Reference-LAPACK PR 638)
3 years ago
Martin Kroeker
ee6643bc6b
Merge pull request #3815 from martin-frbg/lapack690
Fix workspace calculation in the left-looking variant of GEQRF (Reference-LAPACK PR690)
3 years ago
Martin Kroeker
3e2d52c502
Fix workspace calculation in GEQRF/GERQF (Reference-LAPACK PR 638)
3 years ago
Martin Kroeker
cb48c29b6f
Fix workspace calculation (Reference-LAPACK PR690)
3 years ago
Martin Kroeker
8c99d5d1b6
Merge pull request #3796 from martin-frbg/gemmt
Add a trivial GEMMT implementation based on a looped GEMV
3 years ago
Martin Kroeker
b53b0f6bb6
Merge pull request #3802 from martin-frbg/relafix
Fix cmake compilation of ReLAPACK and expose its INCLUDE_ALL option
3 years ago
Martin Kroeker
9a31faf420
Merge pull request #3811 from martin-frbg/issue3805
Improve gcc arch option selecting for Neoverse cpus
3 years ago
Martin Kroeker
e326ef9f0f
Merge pull request #3812 from bartoldeman/cscal-zscal-skylakex
Add [cz]scal microkernels for SKYLAKEX
3 years ago
Martin Kroeker
827a9c6079
Merge pull request #3814 from martin-frbg/traviswait-3
Travis Ci: Increase the wait time for ppc jobs again
3 years ago
Martin Kroeker
d141cf341f
Increase the wait time for ppc jobs again
3 years ago
Martin Kroeker
aad79ab516
Merge pull request #3813 from martin-frbg/azuredynosx
AzureCi: Limit cpu models in OSX_dynarch_cmake to keep it from running out of time
3 years ago
Martin Kroeker
09dd90ca09
Limit cpu models in OSX_dynarch_cmake
3 years ago
Martin Kroeker
f14435cb4b
Merge pull request #3810 from martin-frbg/fix3800
Add fallbacks to RaptorLake entry from PR3800
3 years ago
Bart Oldeman
6c1043eb41
Add [cz]scal microkernels for SKYLAKEX
These are as similar to dscal_microk_skylakex-2.c as possible
for consistency.
Note that before this change SKYLAKEX+ uses generic C functions for
cscal/zscal via commit 2271c350 from #2610 (which is masked by
commit 086d87a30 ). However now #3799 disables FMAs (in turn enabled
by `-march=skylake-avx512`) in the plain C code which fixes excessive
LAPACK test failures more nicely.
3 years ago
Martin Kroeker
be546ec1ad
Add gcc options for Neoverse cpus
3 years ago
Martin Kroeker
c957ad684e
Bump gcc requirement for NeoverseN2 and V1 to 10.4
3 years ago
Martin Kroeker
1865b15240
Add fallbacks to RaptorLake entry
3 years ago
Martin Kroeker
e6204d254f
Update CMakeLists.txt
3 years ago
Martin Kroeker
2e64722681
Update Makefile.rule
3 years ago
Martin Kroeker
aa2a2d9c01
Conditionally compile files that may get replaced by ReLAPACK
3 years ago
Martin Kroeker
1b77764182
Conditionally leave out bits of LAPACK to be overridden by ReLAPACK
3 years ago
Martin Kroeker
fcda11c1ae
Revert special handling of GEMMT
3 years ago
Martin Kroeker
4743d80c22
Merge pull request #3800 from thrasibule/raptorlake
add raptor lake ids
3 years ago
Martin Kroeker
5d02f2e83e
Merge pull request #3806 from martin-frbg/dyn_coop
Fix OPENBLAS_CORETYPE=COOPERLAKE not working in DYNAMIC_ARCH builds
3 years ago
Martin Kroeker
da6e426b13
fix Cooperlake not selectable via environment variable
3 years ago
Martin Kroeker
c970717157
fix missing t in xgemmt rule
Co-authored-by: Alexis <35051714+amontoison@users.noreply.github.com>
3 years ago
Martin Kroeker
62a44c9c5d
Merge pull request #3804 from martin-frbg/issue3803
Remove excess initializer (leftover from rework of PR 3793)
3 years ago
Martin Kroeker
c9d78dc3b2
Remove excess initializer (leftover from rework of PR 3793)
3 years ago
Martin Kroeker
65338a9493
Merge pull request #3799 from bartoldeman/cscal-zscal-no-fma
x86_64: prevent GCC and Clang from generating FMAs in cscal/zscal.
3 years ago
Martin Kroeker
ea6c5f3cf5
Add option RELAPACK_REPLACE
3 years ago
Martin Kroeker
d39978cd7f
Fix includes
3 years ago
Martin Kroeker
ce7ea72de1
Fix include paths
3 years ago
Martin Kroeker
3ebf5d219d
handle INCLUDE_ALL and optional function prefixes
3 years ago
Martin Kroeker
a082d54035
Rename to avoid conflict with OpenBLAS' toplevel config.h
3 years ago
Martin Kroeker
eeebaf2294
move INCLUDE_ALL to (c)make options
3 years ago