Martin Kroeker
f2d6b1c70e
Add multithreading threshold
2 years ago
Martin Kroeker
a495ffc554
Rework multithreading threshold
2 years ago
Martin Kroeker
1659750861
Add ARMV5 to CMAKE prebuild.cmake for cross-compiles ( #3959 )
* Add ARMV5
2 years ago
Martin Kroeker
7f56c31e21
Merge pull request #3958 from tttapa/develop
Add ARMv6 support to cmake/prebuild.cmake
2 years ago
Martin Kroeker
cf80350b89
Merge pull request #3957 from martin-frbg/issue3954
Fix detection of Cray CCE after recent detection changes for gfortran on Cray
2 years ago
Pieter Pas
a79ae116e2
Add ARMV6 to cmake/prebuild.cmake,
based on cpuid_arm.c and param.h
2 years ago
Martin Kroeker
280c8469d4
Fix detection of Cray CCE after detection changes for gfortran on Cray
CCE15 no longer produces a .ident line, so look for HPE name in a note section as well
2 years ago
Martin Kroeker
c68afeb567
Merge pull request #3956 from martin-frbg/issue3955
Update Reference-LAPACK README to 3.11.0
2 years ago
Martin Kroeker
c2b01c6f81
Update to 3.11.0
2 years ago
Martin Kroeker
02920be53a
Merge pull request #3953 from martin-frbg/issue3801
Do not use multithreading for small SYMV/SYR2K workloads
2 years ago
Martin Kroeker
244147495a
Do not use multithreading for small workloads
2 years ago
Martin Kroeker
2885e4d898
Merge pull request #3951 from martin-frbg/issue3940
Do not change the global OMP_NUM_THREADS
2 years ago
Martin Kroeker
e09e494f25
Merge pull request #3950 from martin-frbg/issue3856-syr
Re-enable overriding the LAPACK SYMV,SYR,SPMV and SPR implementations
2 years ago
Martin Kroeker
dac2d7aef0
Merge pull request #3952 from martin-frbg/cleanup3936
Avoid unused variable warning in non-OpenMP builds
2 years ago
Martin Kroeker
743ae5ab18
move declaration of openmp_nthreads under the related ifdef
2 years ago
Martin Kroeker
579bc86671
remove call to omp_set_num_threads
2 years ago
Martin Kroeker
ab32f832a8
fix stray blank on continuation line
2 years ago
Martin Kroeker
e359787e28
restore C/Z SPMV, SPR, SYR,SYMV
2 years ago
Martin Kroeker
6ca6eb7613
Re-enable overriding the LAPACK SYMV,SYR,SPMV and SPR implementations
2 years ago
Martin Kroeker
2b77f28810
Merge pull request #3949 from martin-frbg/lapack750
Improve inline documentation of ?GEJSV (Reference-LAPACK PR 750)
2 years ago
Martin Kroeker
164eafd61d
Improve inline documentation of ?GEJSV (Reference-LAPACK PR 750)
2 years ago
Martin Kroeker
871b730dc5
Create SECURITY.md
2 years ago
Martin Kroeker
7719dbecde
Merge pull request #3946 from martin-frbg/lapack682
Rewrite ?LAQR5 and S/DHGEQZ , add tests for TRECV3 (Reference-LAPACK PR 682)
2 years ago
Martin Kroeker
147e2fbf87
Add numerical tests for TRECV3 (Reference-LAPACK 682)
2 years ago
Martin Kroeker
2a83ec1f79
Rewrite to use FMA with Householder reflectors
2 years ago
Martin Kroeker
e1c3c34178
Standardize coding style (Reference-LAPACK 682)
2 years ago
Martin Kroeker
f7b9391119
Merge pull request #3945 from ahojnnes/user/joschonb/fix-prebuild-path
Wrap variables in cmake/prebuild in quotes to handle spaces in paths
2 years ago
Johannes Schönberger
fbfe474583
f
2 years ago
Johannes Schönberger
ec4700087e
Wrap variables in cmake/prebuild in quotes to handle spaces in paths
2 years ago
Martin Kroeker
e8db1fe89b
Merge pull request #3943 from martin-frbg/llvm15
Add clang option to avoid running out of registers in AVX512 assembly
2 years ago
Martin Kroeker
de937b3194
Add clang option to avoid running out of registers in AVX512 assembly
2 years ago
Martin Kroeker
f3d21039ce
Improve fix from PR3924 ( #3941 )
* compare denominator against DBL_MIN rather than a somewhat arbitrary small number near it
2 years ago
Martin Kroeker
8d6813ec41
Merge pull request #3938 from martin-frbg/issue3937
Fix CMAKE-based cross-compilation to CortexA53 (wrong DGEMM setting)
2 years ago
Martin Kroeker
19d6795122
Merge pull request #3936 from martin-frbg/issue3933
Observe any stricter (OpenMP) thread count limit imposed by openblas_set_num_threads()
2 years ago
Martin Kroeker
50c263716e
Correct DGEMM_UNROLL_M value for A53 in cross-compile
2 years ago
Martin Kroeker
e298d613fa
initialize status variable for openblas_set_num_threads
2 years ago
Martin Kroeker
05aa88268f
add status variable for openblas_set_num_threads
2 years ago
Martin Kroeker
9f9d0012a3
observe thread limit imposed by openblas_set_num_threads()
2 years ago
Martin Kroeker
fe5d3ca8e0
Merge pull request #3935 from martin-frbg/omp_place_num
Fix OpenMP thread counting returning places rather than cores
2 years ago
Martin Kroeker
e38ab079a0
Fix OpenMP thread counting returning places rather than cores
2 years ago
Martin Kroeker
9feaaa3f39
Merge pull request #3932 from martin-frbg/issue3931
Handle unrecognized ASM compiler (from Arm Compiler 22.1) in CMAKE builds
2 years ago
Martin Kroeker
8272dfc552
Handle unrecognized ASM compiler (from Arm Compiler 22.1)
2 years ago
Martin Kroeker
f616c86404
Merge pull request #3930 from sergei-lewis/dot-kernel-early-bail
dot.c early bail fix
2 years ago
Sergei Lewis
cb0a70e0e2
dot.c early bail fix
2 years ago
Martin Kroeker
5925178d03
Merge pull request #3924 from martin-frbg/numpy22025
Avoid overflow from division in GETF2 potentially causing NaN
3 years ago
Zhang Xianyi
f58080278f
Merge pull request #3923 from xctan/fix-cmake-riscv64
Add missing RISC-V architecture in arch.cmake
3 years ago
Martin Kroeker
3d27cbd9a3
avoid overflow in division
3 years ago
Martin Kroeker
a39ced0551
avoid overflow in division
3 years ago
xctan
6a0de3aa39
Add missing RISC-V architecture in arch.cmake
RISC-V support exists in Makefile.system but is missing in arch.cmake. This patch adds riscv64 platform support to cmake building system just like 039e27545f/Makefile.system (L830-L832) did.
3 years ago
Martin Kroeker
039e27545f
Merge pull request #3915 from martin-frbg/issue3910
Fix DYNAMIC_ARCH builds that select only a subset of precisions
3 years ago