Martin Kroeker
|
41fcfd2984
|
Update avx512_check
|
4 years ago |
Martin Kroeker
|
afba0665ad
|
Standalone AVX512 compatibility test to inform building getarch
|
4 years ago |
Martin Kroeker
|
18eb8fba3e
|
Split off AVX512 test from c_check to make its result available to getarch
|
4 years ago |
Martin Kroeker
|
deb4cd1e02
|
remove test code
|
4 years ago |
Martin Kroeker
|
8f10a5f7ae
|
Revert AVX512 capability check from PR #1980
|
4 years ago |
Martin Kroeker
|
e7f5d3c894
|
Utilize compiler AVX512 capability info from c_check when building getarch
|
4 years ago |
Martin Kroeker
|
501bf31e3e
|
Merge pull request #3567 from cenewcombe/develop
Fix unsafe read of Y in zsymv_L_sse2.S
|
4 years ago |
Caroline Newcombe
|
5cc1111383
|
fix unsafe read of Y in assembly kernel
|
4 years ago |
Martin Kroeker
|
8d5a9c2f98
|
Merge pull request #3565 from jonaszhou1/develop
Support Zhaoxin/Centaur kh40000 as ZEN
|
4 years ago |
Martin Kroeker
|
9dcd8aeb7a
|
Merge pull request #3566 from martin-frbg/configtls
Report USE_TLS in get_config output if set
|
4 years ago |
Martin Kroeker
|
bf4642eb7e
|
Report USE_TLS if set
|
4 years ago |
JonasZhou
|
2d0ad89b0d
|
Support Zhaoxin/Centaur kh40000 as ZEN
Signed-off-by: JonasZhou <JonasZhou@zhaoxin.com>
|
4 years ago |
Martin Kroeker
|
1ef97c470c
|
Merge pull request #3550 from guowangy/smatrix-mask-fix
Small Matrix: use proper inline asm input constraint for AVX512 mask
|
4 years ago |
Wangyang Guo
|
225683218c
|
Small Matrix: use proper inline asm input constraint for AVX512 mask
|
4 years ago |
Martin Kroeker
|
10b0428b2c
|
Merge pull request #3549 from martin-frbg/issue3543
Annotate LAPACKE_lsame with attribute const for GCC(+compatible)
|
4 years ago |
Martin Kroeker
|
9c3e0bf319
|
Merge pull request #3548 from martin-frbg/rela-gemmt
Enable the ?GEMMT functions in ReLAPACK
|
4 years ago |
Martin Kroeker
|
1c1ffb0591
|
Annotate LAPACKE_lsame with the const attribute for GCC and compatible compilers
|
4 years ago |
Martin Kroeker
|
4058f32492
|
Fix xGEMMT argument lists
|
4 years ago |
Martin Kroeker
|
35d5105922
|
Enable xGEMMT functions
|
4 years ago |
Martin Kroeker
|
ab304cca69
|
Merge pull request #3547 from martin-frbg/issue3540-2
More build fixes for CooperLake with BFLOAT16 and DYNAMIC_ARCH
|
4 years ago |
Martin Kroeker
|
9c626e466e
|
really fix definition of SHUFFLE_MAGIC_NO
|
4 years ago |
Martin Kroeker
|
0698212c8c
|
Remove stray $
|
4 years ago |
Martin Kroeker
|
9d7429406f
|
Declare SHUFFLE_MAGIC_NO as const to placate clang
|
4 years ago |
Martin Kroeker
|
d9894f45d3
|
Define sbgemm_r to fix DYNAMIC_ARCH builds
|
4 years ago |
Martin Kroeker
|
522f809825
|
Merge pull request #3542 from martin-frbg/issue3540
Fix compilation for CooperLake on Windows/clang
|
4 years ago |
Martin Kroeker
|
d50287fa5b
|
Merge pull request #3544 from giordano/mg/gcc6
Fix compilation of Skylake AVX512 kernels with GCC 6
|
4 years ago |
Mosè Giordano
|
abbc947edb
|
Fix compilation of Skylake AVX512 kernels with GCC 6
|
4 years ago |
Martin Kroeker
|
f2f0e1287b
|
Merge pull request #3541 from martin-frbg/issue3530
Fix compilation for SkylakeX with gcc 6.x
|
4 years ago |
Martin Kroeker
|
c62f8e2c01
|
Prevent compiler attempts to use k0 as mask register
|
4 years ago |
Martin Kroeker
|
80eb581c83
|
Fix non-portable u_int64_t
|
4 years ago |
Martin Kroeker
|
73ffabe6ba
|
Guard uses of _mm512_reduce_add_p?
|
4 years ago |
Martin Kroeker
|
5ad66f0e96
|
Merge pull request #3537 from xianyi/release-0.3.0
Merge back from 0.3.20 release to copy tag
|
4 years ago |
Martin Kroeker
|
0b678b19dc
|
Update version to 0.3.20
|
4 years ago |
Martin Kroeker
|
15ff556862
|
Merge pull request #3536 from xianyi/develop
Update from develop for release 0.3.20
|
4 years ago |
Martin Kroeker
|
1564b632ad
|
Merge branch 'release-0.3.0' into develop
|
4 years ago |
Martin Kroeker
|
dec53e0ca2
|
Update version to 0.3.20
|
4 years ago |
Martin Kroeker
|
c3f8de7923
|
Merge pull request #3535 from martin-frbg/0320changes
Update with 0.3.20 changes
|
4 years ago |
Martin Kroeker
|
c352ac0ae3
|
Update with 0.3.20 changes
|
4 years ago |
Martin Kroeker
|
77433af83e
|
Merge pull request #3532 from martin-frbg/issue3528-2
Fix building a shared library on Mac with flang-classic
|
4 years ago |
Martin Kroeker
|
db7a03dd4c
|
keep flang-classic on MacOS from trying to create an executable instead of a library
|
4 years ago |
Martin Kroeker
|
0e04710099
|
filter out libflangmain as well
|
4 years ago |
Martin Kroeker
|
dc80925c92
|
Merge pull request #3531 from martin-frbg/issue2973
Add .NOTPARALLEL: to MATGEN Makefile as a workaround for builds on DFS
|
4 years ago |
Martin Kroeker
|
e2bf3f31a6
|
Add .NOTPARALLEL: as a workaround for builds on DFS
|
4 years ago |
Martin Kroeker
|
92d243fee3
|
Merge pull request #3527 from martin-frbg/issue3490
Treat AVX512-enabled Alder Lake like Cooper Lake/Sapphire Rapids
|
4 years ago |
Martin Kroeker
|
fa3e9f25e6
|
Support AVX512-enabled Alder Lake
|
4 years ago |
Martin Kroeker
|
f7e8f9ec57
|
Support AVX512-enabled AlderLake
|
4 years ago |
Martin Kroeker
|
7656aba00e
|
Merge pull request #3493 from martin-frbg/casts+cleanup
WIP casts and cleanups
|
4 years ago |
Martin Kroeker
|
aec32e5bd4
|
Update azure-pipelines.yml
|
4 years ago |
Martin Kroeker
|
3007ca6371
|
Merge pull request #3524 from martin-frbg/lapack646
Fix input argument check in ?GEQRT2 (from Reference-LAPACK PR 646)
|
4 years ago |
Martin Kroeker
|
a3eea3e127
|
Fix input argument check (LAPACK PR 646)
|
4 years ago |