Martin Kroeker
34ecd967a5
Merge pull request #3580 from martin-frbg/dynx86_sbgemm
Remove extraneous (and wrong) definition of sbgemm_r on x86_64
3 years ago
Martin Kroeker
2519c9d93f
Merge pull request #3579 from martin-frbg/issue3557-2
Fix malfunctioning AVX512 check
3 years ago
Martin Kroeker
40302558ed
Remove extraneous (and wrong) definition of sbgemm_r on x86_64
3 years ago
Martin Kroeker
b79b99d695
Merge branch 'xianyi:develop' into issue3557-2
3 years ago
Martin Kroeker
c87a4dbf35
Fix checks for AVX512 and atomics
3 years ago
Martin Kroeker
93a81856ae
Revert AVX512 capability check from PR #1980 (moved to build)
3 years ago
Martin Kroeker
9fbeb88fb8
Utilize compiler AVX512 capability info from c_check when building getarch
3 years ago
Martin Kroeker
4cb302a596
Merge pull request #3561 from AlessioZanga/patch-msvc
Remove MSVC limitation
3 years ago
Martin Kroeker
f67977a323
Merge pull request #3576 from martin-frbg/cmaketestbom
Skip BLAS tests if Windows powershell added a BOM
3 years ago
Martin Kroeker
0ee2d15fdb
Merge pull request #3577 from martin-frbg/azure_win2022
Update Windows jobs in Azure CI to use Windows2022
3 years ago
Martin Kroeker
a0e86adf93
Update Windows jobs in Azure CI to use Windows2022
3 years ago
Martin Kroeker
2408315d10
Skip tests if Windows powershell added a BOM
3 years ago
Martin Kroeker
694f6c5c8d
Merge pull request #3574 from AdamNiederer/fix-dynamic-list-compilation
Fix broken elif in dynamic.c
3 years ago
Adam Niederer
69f2ac4ea2
Fix broken elif in dynamic.c
This fixes compilation in the following case:
$(MAKE) USE_OPENMP=1 USE_THREAD=1 NO_LAPACK=0 DYNAMIC_ARCH=1 \
DYNAMIC_LIST="HASWELL SKYLAKEX ATOM COOPERLAKE SAPPHIRERAPIDS ZEN"
3 years ago
Martin Kroeker
501bf31e3e
Merge pull request #3567 from cenewcombe/develop
Fix unsafe read of Y in zsymv_L_sse2.S
3 years ago
Caroline Newcombe
5cc1111383
fix unsafe read of Y in assembly kernel
3 years ago
Martin Kroeker
8d5a9c2f98
Merge pull request #3565 from jonaszhou1/develop
Support Zhaoxin/Centaur kh40000 as ZEN
3 years ago
Martin Kroeker
9dcd8aeb7a
Merge pull request #3566 from martin-frbg/configtls
Report USE_TLS in get_config output if set
3 years ago
Martin Kroeker
bf4642eb7e
Report USE_TLS if set
3 years ago
JonasZhou
2d0ad89b0d
Support Zhaoxin/Centaur kh40000 as ZEN
Signed-off-by: JonasZhou <JonasZhou@zhaoxin.com>
3 years ago
AlessioZanga
ed2871cb71
Change `BUILD_WITHOUT_LAPACK` to `OFF` by default
3 years ago
Alessio Zanga
ed8c028f7f
Remove MSVC limitation
3 years ago
Martin Kroeker
1ef97c470c
Merge pull request #3550 from guowangy/smatrix-mask-fix
Small Matrix: use proper inline asm input constraint for AVX512 mask
3 years ago
Wangyang Guo
225683218c
Small Matrix: use proper inline asm input constraint for AVX512 mask
3 years ago
Martin Kroeker
10b0428b2c
Merge pull request #3549 from martin-frbg/issue3543
Annotate LAPACKE_lsame with attribute const for GCC(+compatible)
3 years ago
Martin Kroeker
9c3e0bf319
Merge pull request #3548 from martin-frbg/rela-gemmt
Enable the ?GEMMT functions in ReLAPACK
3 years ago
Martin Kroeker
1c1ffb0591
Annotate LAPACKE_lsame with the const attribute for GCC and compatible compilers
3 years ago
Martin Kroeker
4058f32492
Fix xGEMMT argument lists
3 years ago
Martin Kroeker
35d5105922
Enable xGEMMT functions
3 years ago
Martin Kroeker
ab304cca69
Merge pull request #3547 from martin-frbg/issue3540-2
More build fixes for CooperLake with BFLOAT16 and DYNAMIC_ARCH
3 years ago
Martin Kroeker
9c626e466e
really fix definition of SHUFFLE_MAGIC_NO
3 years ago
Martin Kroeker
0698212c8c
Remove stray $
3 years ago
Martin Kroeker
9d7429406f
Declare SHUFFLE_MAGIC_NO as const to placate clang
3 years ago
Martin Kroeker
d9894f45d3
Define sbgemm_r to fix DYNAMIC_ARCH builds
3 years ago
Martin Kroeker
522f809825
Merge pull request #3542 from martin-frbg/issue3540
Fix compilation for CooperLake on Windows/clang
3 years ago
Martin Kroeker
d50287fa5b
Merge pull request #3544 from giordano/mg/gcc6
Fix compilation of Skylake AVX512 kernels with GCC 6
3 years ago
Mosè Giordano
abbc947edb
Fix compilation of Skylake AVX512 kernels with GCC 6
3 years ago
Martin Kroeker
f2f0e1287b
Merge pull request #3541 from martin-frbg/issue3530
Fix compilation for SkylakeX with gcc 6.x
3 years ago
Martin Kroeker
c62f8e2c01
Prevent compiler attempts to use k0 as mask register
3 years ago
Martin Kroeker
80eb581c83
Fix non-portable u_int64_t
3 years ago
Martin Kroeker
73ffabe6ba
Guard uses of _mm512_reduce_add_p?
3 years ago
Martin Kroeker
5ad66f0e96
Merge pull request #3537 from xianyi/release-0.3.0
Merge back from 0.3.20 release to copy tag
3 years ago
Martin Kroeker
0b678b19dc
Update version to 0.3.20
3 years ago
Martin Kroeker
15ff556862
Merge pull request #3536 from xianyi/develop
Update from develop for release 0.3.20
3 years ago
Martin Kroeker
1564b632ad
Merge branch 'release-0.3.0' into develop
3 years ago
Martin Kroeker
dec53e0ca2
Update version to 0.3.20
3 years ago
Martin Kroeker
c3f8de7923
Merge pull request #3535 from martin-frbg/0320changes
Update with 0.3.20 changes
3 years ago
Martin Kroeker
c352ac0ae3
Update with 0.3.20 changes
3 years ago
Martin Kroeker
77433af83e
Merge pull request #3532 from martin-frbg/issue3528-2
Fix building a shared library on Mac with flang-classic
4 years ago
Martin Kroeker
db7a03dd4c
keep flang-classic on MacOS from trying to create an executable instead of a library
4 years ago