Martin Kroeker
2fb65d062b
Update .drone.yml
4 years ago
Martin Kroeker
3bd81e9b91
Update .drone.yml
4 years ago
Martin Kroeker
0161aba5dc
Update .drone.yml
4 years ago
Martin Kroeker
3f021a1b7d
try to force installation of a specific version of gcc
4 years ago
Martin Kroeker
6667aa5bc8
Update .drone.yml
4 years ago
Martin Kroeker
3bdca029b2
Update .drone.yml
4 years ago
Martin Kroeker
b33002365f
Update .drone.yml
4 years ago
Martin Kroeker
ea48bbac6b
Update .drone.yml
4 years ago
Martin Kroeker
3cbbb3a37f
run blas-tester on ThunderX/Falkor
4 years ago
Martin Kroeker
a0c6350f41
Add OSX build job with Homebrew OpenMP in a CMAKE build
4 years ago
Martin Kroeker
435d84a7ce
Merge pull request #3332 from martin-frbg/travisbadge
Update Travis badge in README
4 years ago
Martin Kroeker
139f632ca4
Merge pull request #3334 from Guobing-Chen/BF16_gemm_full_kernel
Add all SBGEMM kernels for IA AVX512-BF16 based platforms
4 years ago
Chen, Guobing
5d86becdae
Add all SBGEMM kernels for IA AVX512-BF16 based platforms
Added all SBGEMM kernels including NN/NT/TN/TT for both ColMajor and
RowMajor, based on AVX512-BF16 ISA set on IA.
Signed-off-by: Chen, Guobing <guobing.chen@intel.com>
4 years ago
Martin Kroeker
93c8bafff5
Update Travis badge in README
4 years ago
Martin Kroeker
b5858c4472
Merge pull request #3330 from xianyi/issue3321
Improve the "tried to allocate too many buffers" error message
4 years ago
Martin Kroeker
898212efcd
Actually add the message to the TLS section
4 years ago
Martin Kroeker
210a1584c5
Rebase source and edit TLS version of the message as well
4 years ago
Martin Kroeker
e6d6d3ee43
Merge pull request #3331 from gxw-loongson/develop
Fixed typos about LOONGARCH64
4 years ago
gxw
0b8f7c8c10
Add cmake support for LOONGARCH64
4 years ago
Martin Kroeker
f2a7a67f5a
Improve the "tried to allocate too many buffers" error message
4 years ago
Martin Kroeker
e0e88f9edc
Merge pull request #3329 from martin-frbg/issue3272
Work around gcc11+ miscompiling C/ZBLAS3 tests at -O3
4 years ago
Martin Kroeker
5dc6aa74f0
Disable gfortran tree vectorizer to avoid gcc11+ miscompilation at O3
4 years ago
Martin Kroeker
e78fbe4654
Disable gfortran tree vectorizer to avoid gcc11+ miscompilation at O3
4 years ago
Martin Kroeker
b4f4ed378b
Disable gfortran tree vectorizer to avoid gcc11+ miscompilation at O3
4 years ago
Martin Kroeker
cbc41973fd
Disable gfortran tree vectorizer to avoid gcc11+ miscompilation at O3
4 years ago
gxw
34207bdf5b
Fixed typos about LOONGARCH64
4 years ago
Martin Kroeker
1b6db3dbba
Merge pull request #3327 from h-vetinari/lapack597_redux
Complete the carry of lapack PR 597
4 years ago
Martin Kroeker
f681553c6a
Merge pull request #3326 from wattoc/develop
Include Haiku in processor count checks
4 years ago
Martin Kroeker
afadeeba2a
Merge pull request #3325 from gxw-loongson/develop
Add support for LOONGARCH64
4 years ago
Isuru Fernando
02d4a49761
Also make sure the `1` is INTEGER*4 for OMP_SET_NUM_THREADS
4 years ago
Craig Watson
4d7dfe4845
Include Haiku in processor count checks
4 years ago
gxw
af0a69f355
Add support for LOONGARCH64
4 years ago
Martin Kroeker
5a2fe5bfb9
Merge pull request #3323 from martin-frbg/issue3322
GCC did not support -mtune for ARM64 before 5.1
4 years ago
Martin Kroeker
342d3e8b5c
Merge pull request #3314 from martin-frbg/lapack597
Fix LAPACK testsuite compatibility with libomp (Reference-LAPACK PR 597)
4 years ago
Martin Kroeker
efbd7c7840
GCC did not support -mtune for ARM64 before 5.1
4 years ago
Martin Kroeker
3a7955cd93
Merge pull request #3320 from martin-frbg/issue3318
Empirical workaround for numpy SVD NaN problem from issue 3318
4 years ago
Martin Kroeker
47ba85f314
Fix regex to match kernels suffixed with cpuname too
4 years ago
Martin Kroeker
30f23be0f9
Rework setting of -mfma to only apply it where necessary
4 years ago
Martin Kroeker
49bbf330ca
Empirical workaround for numpy SVD NaN problem from issue 3318
4 years ago
Martin Kroeker
38d5b4b124
Update version to 0.3.17.dev
4 years ago
Martin Kroeker
6e3fbe8ac5
Update version to 0.3.17.dev
4 years ago
Martin Kroeker
86273392e5
Merge pull request #3317 from xianyi/release-0.3.0
merge 0.3.17 back into develop to copy tag
4 years ago
Martin Kroeker
d909f9f3d4
Update version to 0.3.17
4 years ago
Martin Kroeker
12d3d94e2e
Merge pull request #3316 from xianyi/develop
Merge develop for bugfix release 0.3.17
4 years ago
Martin Kroeker
f349be3bdb
Merge branch 'release-0.3.0' into develop
4 years ago
Martin Kroeker
4777eb678f
Update version to 0.3.17
4 years ago
Martin Kroeker
415876d117
Merge pull request #3315 from martin-frbg/changelog0317
Update Changelog for 0.3.17
4 years ago
Martin Kroeker
da8435dc36
Update Changelog for 0.3.17
4 years ago
Martin Kroeker
4c7065f3ee
Merge pull request #3313 from martin-frbg/3266-2
Remove BLASLONG casts from SPARC parameter entries
4 years ago
Martin Kroeker
f62bfaafe8
Merge pull request #3312 from martin-frbg/revert_3260
Temporarily disable the SkylakeX sgemv_t microkernel
4 years ago