Martin Kroeker
bb8c3f6861
Add ld/binutils version check for POWER10 support
5 years ago
Martin Kroeker
ff65952e46
Move HAVE_P10_SUPPORT to the build system
to be able to include a binutils version check
5 years ago
Martin Kroeker
6208c9899e
Merge pull request #104 from xianyi/develop
rebase
5 years ago
Martin Kroeker
8e20ab21c8
Merge pull request #2924 from martin-frbg/issue2920
Put back all symbols accidentally dropped in the reorganization of gensymbol
5 years ago
Martin Kroeker
dc6e44c3f8
Merge pull request #2916 from martin-frbg/issue2911
Clean up duplicate definitions in POWER8 kernels and fix power10 option passing
5 years ago
Martin Kroeker
4ad33c46b0
Add back symbols that got dropped when splitting by type
5 years ago
Martin Kroeker
fe2a922ada
Add POWER10 compiler options to CCOMMON_OPT rather than COMMON_OPT
5 years ago
Martin Kroeker
9cac379655
Merge pull request #103 from xianyi/develop
rebase
5 years ago
Martin Kroeker
a61c086408
Fix spurious trailing whitespace in comment
5 years ago
Martin Kroeker
5b9ebe4f8a
Merge pull request #2919 from isuruf/export
Fix exporting some lapack and cblas symbols
5 years ago
Martin Kroeker
7eddaf0d6f
Remove -mmma again (reduntant with cpu=power10) and add override statements
5 years ago
Isuru Fernando
14b1d33933
Fix exporting some lapack and cblas
5 years ago
Martin Kroeker
77669b019d
Merge pull request #2915 from bartoldeman/no-empty_sgemm_direct_skylakex
sgemm_direct_skylakex: fix 75eeb26 regression.
5 years ago
Martin Kroeker
5e8ddc9001
Merge pull request #2913 from martin-frbg/issue2910
Support cross-compiling for Apple Vortex
5 years ago
Bart Oldeman
03e781b766
sgemm_direct_skylakex: fix 75eeb26 regression.
The
`#if defined(SKYLAKEX) || defined (COOPERLAKE)`
from that commit was before #include "common.h" so caused the
compiled function to be empty, returning garbage results for
qualifying sgemm's on those architectures.
Closes #2914
5 years ago
Martin Kroeker
f1a4071d8c
Clean up STACKSIZE redefinition
5 years ago
Martin Kroeker
97cf10062f
Clean up STACKSIZE redefinition
5 years ago
Martin Kroeker
17e288e18d
Clean up STACKSIZE redefinition
5 years ago
Martin Kroeker
c1422f3e46
Clean up STACKSIZE redefinition
5 years ago
Martin Kroeker
d85b24e103
Clean up STACKSIZE redefinition
5 years ago
Martin Kroeker
7d6c85f9da
Add compiler option -mmma for POWER10
5 years ago
Martin Kroeker
2e7ee7c716
Fix naming of L2 cache size item reported for Vortex
5 years ago
Martin Kroeker
efd47b0104
Merge pull request #2909 from isuruf/patch-1
Need a space when redirecting to file
5 years ago
Martin Kroeker
f5902ab0a1
Support cross-compiling for Apple Vortex
5 years ago
Martin Kroeker
1a0c185122
Support cross-compiling for Apple Vortex
5 years ago
Martin Kroeker
89eea6b455
Merge pull request #102 from xianyi/develop
rebase
5 years ago
Isuru Fernando
a5c667b55c
Need a space when redirecting to file
Following two commands have two completely different meanings
perl ./gensymbol objcopy x86_64 _ 0 0 0 0 0 0 "" "64_" 1 0 1 1 1 1 > objcopy.def
perl ./gensymbol objcopy x86_64 _ 0 0 0 0 0 0 "" "64_" 1 0 1 1 1 1> objcopy.def
5 years ago
Martin Kroeker
0ac6102708
Update version string to 0.3.11.dev
5 years ago
Martin Kroeker
26a701f4ad
Update version string to 0.3.11.dev
5 years ago
Martin Kroeker
fcd0fa1a3a
Merge pull request #2908 from xianyi/release-0.3.0
Synchronyse tag with release 0.3.11
5 years ago
Martin Kroeker
51c22612eb
Merge pull request #2907 from xianyi/develop
Update from develop for 0.3.11
5 years ago
Martin Kroeker
b8f689200e
Update version number to 0.3.11
5 years ago
Martin Kroeker
fe9015b619
Update version for 0.3.11 release
5 years ago
Martin Kroeker
f99b8c1502
Merge pull request #2906 from martin-frbg/changelog-0311
Update Changelog.txt with the 0.3.11 changes
5 years ago
Martin Kroeker
5381a18056
Update Changelog.txt with the 0.3.11 changes
5 years ago
Martin Kroeker
e35576c6fc
Merge pull request #2905 from martin-frbg/aocc-clang
Add -mavx for clang & aocc
5 years ago
Martin Kroeker
f1bb85d378
Add AVX flags for clang/aocc as well
5 years ago
Martin Kroeker
25907e672b
Merge pull request #101 from xianyi/develop
rebase
5 years ago
Martin Kroeker
9789375389
Merge pull request #2900 from martin-frbg/fixcmake_sse
Add compiler options for SSE to the cmake support files
5 years ago
Martin Kroeker
f64243ff57
Add compiler options for sse/sse2/ssse3/sse4.1
5 years ago
Martin Kroeker
786c0a3ce8
Add sse options for use of intrinics with older compilers
5 years ago
Martin Kroeker
df70667043
fix core list for sse/sse2
5 years ago
Martin Kroeker
e6c5b13a18
Merge pull request #2898 from martin-frbg/morefixes
More pre-release fixes
5 years ago
Martin Kroeker
f071d1207a
add sse2
5 years ago
Martin Kroeker
dc6cefd2f5
Expressly enable -msse for 32bit DYNAMIC_ARCH kernels
5 years ago
Martin Kroeker
c339c40c01
Silence a redefinition warning
5 years ago
Martin Kroeker
ac8af9cec6
Add -msse where supported, apparently required for older gcc
5 years ago
Martin Kroeker
10379fc83b
Use ifdef instead of if
5 years ago
Martin Kroeker
a85ac71633
Merge pull request #100 from xianyi/develop
rebase
5 years ago
Martin Kroeker
4c25910da0
Merge pull request #2896 from martin-frbg/intrin-double
Add compiler flag for SSE4 where available
5 years ago