Martin Kroeker
|
5fd871d7ea
|
Introduce a lower limit for multithreading
|
1 year ago |
Martin Kroeker
|
6ca9ffa7f5
|
Merge pull request #4655 from yamazakimitsufumi/update_2d_thread_distribution
Expanding the scope of 2D thread distribution to improve multi-threaded DGEMM performance
|
1 year ago |
Martin Kroeker
|
b45a78c6e9
|
fix zdotu argument passing in utest_ext on windows (#4691)
* fix passing of results on windows
|
1 year ago |
Martin Kroeker
|
1ab9f50561
|
Merge pull request #4690 from mattip/blasint
use blasint instead of int to quiet warnings
|
1 year ago |
Matti Picus
|
243640c354
|
use blasint instead of int to quiet warnings
|
1 year ago |
Martin Kroeker
|
f0560f906f
|
Merge pull request #4689 from martin-frbg/issue4684
Fix compilation of the BLAS extension utests for NO_CBLAS=1
|
1 year ago |
Martin Kroeker
|
e1e0d9a2ae
|
Merge pull request #4688 from XiWeiGu/loongarch64_fixed_gcc14_compilation
loongarch64: Fixed GCC14 compilation issue
|
1 year ago |
Martin Kroeker
|
d8baf2f2ea
|
Support compilation without CBLAS
|
1 year ago |
Martin Kroeker
|
a6c184d150
|
forward NO_CFLAGS to the CFLAGS, if set
|
1 year ago |
gxw
|
ecf8b588a9
|
loongarch64: Fixed GCC14 compilation issue
|
1 year ago |
Martin Kroeker
|
8da6f7e5f2
|
Merge pull request #4686 from XiWeiGu/loongarch64_dgemm_kernel_16x6
Loongarch64: Improving the Performance and Stability of dgemm
|
1 year ago |
gxw
|
f9a26240a7
|
loongarch64: Fixed icamax_lsx
|
1 year ago |
gxw
|
cb0f707409
|
loongarch64: Fixed utest fork:safety
|
1 year ago |
gxw
|
637c650f4f
|
loongarch64: Add buffer offset for target LOONGSON3R5
|
1 year ago |
Martin Kroeker
|
5d678f1831
|
Merge pull request #4685 from martin-frbg/issue4660-2
Fix builds for LOONGARCH64 in LSX mode
|
1 year ago |
Martin Kroeker
|
b45d8e1ab2
|
remove stray comma
|
1 year ago |
Martin Kroeker
|
5500b4ab26
|
Merge pull request #4680 from theAeon/develop
Expose whether locking is enabled in get_config
|
1 year ago |
gxw
|
6017ad7146
|
loongarch64: Update dgemm_kernel_16x4 to dgemm_kernel_16x6
|
1 year ago |
Martin Kroeker
|
d66aa63478
|
Merge pull request #4681 from martin-frbg/fix4662-2
fix HUGETLB allocation for TLS mode as well
|
1 year ago |
Martin Kroeker
|
f0f1ff7820
|
fix HUGETLB allocation for TLS mode as well
|
1 year ago |
Andrew Robbins
|
edfe1aa471
|
Expose whether locking is enabled in get_config
|
1 year ago |
Martin Kroeker
|
edeb5259a1
|
Merge pull request #4679 from martin-frbg/fix4662
Restore Loongson LA64ARCH handling
|
1 year ago |
Martin Kroeker
|
4376b6f7d2
|
Restore Loongson LA64ARCH handling
|
1 year ago |
Martin Kroeker
|
8735b54fa8
|
Merge pull request #4662 from martin-frbg/hugetlb-doc
Fix and document the two HUGETLB options for buffer allocation in Makefile.rule
|
1 year ago |
Martin Kroeker
|
fc10673fd3
|
Merge branch 'develop' into hugetlb-doc
|
1 year ago |
Martin Kroeker
|
c20189cc82
|
Merge pull request #4677 from martin-frbg/issue4676
Add autodetection of Intel Meteor Lake and Emerald Rapids
|
1 year ago |
Martin Kroeker
|
bbd227ce4a
|
Add Intel Meteor Lake and Emerald Rapids
|
1 year ago |
Martin Kroeker
|
f034745ce6
|
Merge pull request #4675 from martin-frbg/issue4619
Mention LD_LIBRARY_PATH in user documentation
|
1 year ago |
Martin Kroeker
|
a82ecadc11
|
mention LD_LIBRARY_PATH
|
1 year ago |
Martin Kroeker
|
b859f6f191
|
Merge pull request #4617 from cyk2018/patch-1
[Doc]Update user_manual.md for static linker
|
1 year ago |
Martin Kroeker
|
dc99b61380
|
sort unwanted interdependencies of alloc_shm and alloc_hugetlb
|
1 year ago |
Martin Kroeker
|
9c4e10fbd1
|
sort hugetlb and shm alloc options
|
1 year ago |
Martin Kroeker
|
a63d71129c
|
Merge pull request #4671 from martin-frbg/issue4668
Silence a GCC14 warning/error in the f2c-converted LAPACK
|
1 year ago |
Martin Kroeker
|
3d26837a35
|
Suppress GCC14 error exit in the f2c-converted LAPACK
|
1 year ago |
Martin Kroeker
|
7c915e64ca
|
Silence a GCC14 warning/error in the f2c-converted LAPACK
|
1 year ago |
Martin Kroeker
|
edacf9b397
|
Work around spurious BLAS3 test errors on LOONGSON3R3/4 (#4667)
Force compilation with gfortran to use O0 on older Loongson hardware to avoid spurious test failures
|
1 year ago |
Martin Kroeker
|
89e3fd0821
|
Merge pull request #4666 from martin-frbg/issue4633
Fix spurious errors in the extended utest for INTERFACE64=1 on big-endian systems
|
1 year ago |
Martin Kroeker
|
b1d722fc0c
|
Fix cast to work with INTERFACE64 (especially on big-endian)
|
1 year ago |
Martin Kroeker
|
1031d161f6
|
Merge pull request #4663 from ayappanec/develop
Fix openblas_utest_ext build in AIX
|
1 year ago |
Ayappan P
|
f4ee0a423b
|
Fix openblas_utest_ext build in AIX
|
1 year ago |
Martin Kroeker
|
faf7b3d1bb
|
Document the two HUGETLB options for buffer allocation
|
1 year ago |
Martin Kroeker
|
ab5882ebf0
|
Merge pull request #4661 from martin-frbg/issue4660
Fix CMAKE builds for Loongarch64
|
1 year ago |
Martin Kroeker
|
69aa93e34f
|
Fix Loongson compiler flag check
|
1 year ago |
Martin Kroeker
|
015042f7b5
|
Fix Loongson compiler flag test
|
1 year ago |
Martin Kroeker
|
992b71fea2
|
remove stray comma
|
1 year ago |
Martin Kroeker
|
d421dec278
|
Merge pull request #4656 from zboszor/fix-x86-64-build-v2
Add forgotten conditional uses of PREFETCH
|
1 year ago |
Martin Kroeker
|
ae695d4ca0
|
Merge pull request #4642 from XiWeiGu/loongarch64_clang
CI: Add clang test for loongarch64
|
1 year ago |
gxw
|
1cdad09760
|
CI: Add clang test for loongarch64
|
1 year ago |
gxw
|
7cd438a5ac
|
loongarch64: Fixed clang compilation issues
|
1 year ago |
Martin Kroeker
|
35d84ad012
|
Merge pull request #4658 from mattip/remove-extra-suffix
do not add LIBNAMESUFFIX to dylib
|
1 year ago |