Martin Kroeker
e08bc5de89
Update .cirrus.yml
3 years ago
Martin Kroeker
125c52925b
Update .cirrus.yml
3 years ago
Martin Kroeker
d6a7809504
Merge branch 'xianyi:develop' into cirrusjobs
3 years ago
Martin Kroeker
30a0ccbd14
Merge pull request #4014 from martin-frbg/issue4013
Generally disable gcc's tree-vectorizer in x86_64 SGEMV,SSYMV,ZGEMV,C/ZDOT
3 years ago
Martin Kroeker
c9174ae8d7
Disable gcc's tree-vectorizer pass on all operating systems
3 years ago
Martin Kroeker
c2fe9cb91f
Disable gcc's tree-vectorizer pass on all operating systems
3 years ago
Martin Kroeker
66b39b835c
Disable gcc's tree-vectorizer pass on all operating systems
3 years ago
Martin Kroeker
bb6d6735bf
Disable gcc's tree-vectorizer pass on all operating systems
3 years ago
Martin Kroeker
d18efaed20
Disable gcc's tree-vectorizer pass on all operating systems
3 years ago
Martin Kroeker
99f6d31ed5
Disable gcc's tree-vectorizer pass on all operating systems
3 years ago
Martin Kroeker
7de9335c56
Disable gcc's tree-vectorizer pass on all operating systems
3 years ago
Martin Kroeker
93df4cfc82
Update .cirrus.yml
3 years ago
Martin Kroeker
ed5e4caf71
Update .cirrus.yml
3 years ago
Martin Kroeker
413bc21368
Update .cirrus.yml
3 years ago
Martin Kroeker
f3f8edb365
Update .cirrus.yml
3 years ago
Martin Kroeker
0980554be6
Update .cirrus.yml
3 years ago
Martin Kroeker
58ade09815
Update .cirrus.yml
3 years ago
Martin Kroeker
aded7be904
Update .cirrus.yml
3 years ago
Martin Kroeker
437c0bf2b4
Merge pull request #3843 from Mousius/switch-ratio
Propagate SWITCH_RATIO to DYNAMIC_ARCH builds
3 years ago
Martin Kroeker
c628030669
Merge pull request #3855 from Mousius/more-switch-ratio-tuning
SWITCH_RATIO for Arm(R) Neoverse(TM) architecture
3 years ago
Martin Kroeker
efcf71255a
Merge pull request #4003 from martin-frbg/issue3995
Fix instabilities in CGEMM/CTRMM/DNRM2 on Apple M1/M2 under OSX
3 years ago
Martin Kroeker
51dd1339e7
Merge pull request #4010 from martin-frbg/issue3989-2
Remove any stray trailing dash from CROSS_SUFFIX
3 years ago
Martin Kroeker
aa321e99f3
Update .cirrus.yml
3 years ago
Martin Kroeker
9e6cf3540f
Update .cirrus.yml
3 years ago
Martin Kroeker
d43993dd70
Update .cirrus.yml
3 years ago
Martin Kroeker
479509bb37
Remove any stray trailing dash from CROSS_SUFFIX (as would result from clang -arch)
3 years ago
Martin Kroeker
9e0d126cc2
Update .cirrus.yml
3 years ago
Martin Kroeker
82124511fe
Update .cirrus.yml
3 years ago
Martin Kroeker
0724df404c
Merge branch 'xianyi:develop' into cirrusjobs
3 years ago
Chris Sidebottom
5b165420b5
SWITCH_RATIO for Arm(R) Neoverse(TM) architecture
This seems like a good balance of values for reasonably sized matrices. With `SWITCH_RATIO=16` the DGEMM scales better to bigger sizes but the better solution would be some kind of
thread throttling so I've gone with `SWITCH_RATIO=8`.
3 years ago
Chris Sidebottom
32f2fafde7
Propagate SWITCH_RATIO to DYNAMIC_ARCH builds
Previously dynamic builds were either using the default SWITCH_RATIO
or one from the higher level architecture; this patch ensures the
dynamic builds can use this parameter as well.
3 years ago
Martin Kroeker
a5e1fdd525
Merge pull request #4007 from Mousius/update-contributors
Add Chris Sidebottom to CONTRIBUTORS.md
3 years ago
Martin Kroeker
44164e3a3d
revert "move alpha out of register 18" (out of PR scope, no SVE on Apple hw)
3 years ago
Chris Sidebottom
bfc20c2e97
Add Chris Sidebottom to CONTRIBUTORS.md
3 years ago
Martin Kroeker
a44422f0d5
Merge pull request #3983 from thrasibule/makeflags
parallel build fixes
3 years ago
Martin Kroeker
73e6fcb925
Merge pull request #4006 from martin-frbg/issue4005
Fix ?GEMMT implementation
3 years ago
Martin Kroeker
38d7a7b562
Fix ?GEMMT
3 years ago
Martin Kroeker
8be68fa7f4
move declaration of sca to really keep the compiler from throwing it out (for now)
3 years ago
Martin Kroeker
4eac244c9a
Merge pull request #4004 from martin-frbg/ccheckif
fix missing blank in c_check
3 years ago
Martin Kroeker
970e611e00
fix missing blank in test
3 years ago
Martin Kroeker
f096a339e4
Use long value fields for cpu ident on OSX
3 years ago
Martin Kroeker
3727672a74
Improve workaround and keep compilers from optimizing it out
3 years ago
Martin Kroeker
108a21e47a
Move ALPHA out of register 18 (reserved on OSX)
3 years ago
Martin Kroeker
0b1acb0ba3
Move ALPHA_I out of register 18 (reserved on OSX)
3 years ago
Martin Kroeker
c7bbad09ad
Move ALPHA_I out of register 18 (reserved on OSX)
3 years ago
Martin Kroeker
cda29633a3
move ALPHA_I out of register 18 (reserved on OSX)
3 years ago
Martin Kroeker
6f759a9ce9
Merge pull request #4002 from imzhuhl/spr_detect
Fix x86 detection error
3 years ago
Honglin Zhu
ac650225c1
Fix x86 detection error
3 years ago
Martin Kroeker
58de28f332
Merge pull request #3999 from martin-frbg/issue3998
Convert CMAKE booleans to 0/1 values for gensymbol
3 years ago
Martin Kroeker
2ea00788c2
Add ?GEMMT
3 years ago