wjc404
7eecd8e39c
Add files via upload
6 years ago
wjc404
95fb98f556
Update dgemm_kernel_4x8_haswell.S
6 years ago
wjc404
4801c6d36b
Update dgemm_kernel_4x8_haswell.S
6 years ago
wjc404
9440fa607d
Add files via upload
6 years ago
wjc404
94db259e5b
Add files via upload
6 years ago
wjc404
f49f8047ac
Add files via upload
6 years ago
wjc404
825777faab
Update dgemm_kernel_4x8_haswell.S
6 years ago
wjc404
9c89757562
Add files via upload
6 years ago
wjc404
9b04baeaee
Update dgemm_kernel_4x8_haswell.S
6 years ago
wjc404
8a074b3965
Update dgemm_kernel_4x8_haswell.S
6 years ago
wjc404
211ab03b14
Update dgemm_kernel_4x8_haswell.S
6 years ago
wjc404
1733f927e6
Update dgemm_kernel_4x8_haswell.S
6 years ago
wjc404
182b06d6ad
Update dgemm_kernel_4x8_haswell.S
6 years ago
wjc404
7a9050d681
Update dgemm_kernel_4x8_haswell.S
6 years ago
wjc404
0ba29fd262
Update dgemm_kernel_4x8_haswell.S for zen2
replaced a bunch of vpermpd instructions with vpermilpd and vperm2f128
6 years ago
Martin Kroeker
bafa021ed6
Merge pull request #2181 from isuruf/install_name
Change install_name on osx to match linux
6 years ago
Isuru Fernando
b89d9762a2
Change install_name on osx to match linux
6 years ago
Martin Kroeker
08dedf4c5e
Merge pull request #2177 from martin-frbg/noaff
Fix surprising behaviour of NO_AFFINITY=0
6 years ago
Martin Kroeker
b89c781637
Fix surprising behaviour of NO_AFFINITY=0
6 years ago
Martin Kroeker
dd7ff77f4b
Merge pull request #2175 from martin-frbg/cmake-mingw-fixes
Fix CMAKE compilation with MinGW32 and add it to Appveyor
6 years ago
Martin Kroeker
8fb76134bc
Mingw32 needs leading underscore on object names
(also copy BUNDERSCORE settings for FORTRAN from the corresponding Makefile)
6 years ago
Martin Kroeker
04d671aae2
Make disabling DYNAMIC_ARCH on unsupported systems work
needs to be unset in the cache for the change to have any effect
6 years ago
Martin Kroeker
f69a0be712
Add getarch flags to disable AVX on x86
(and other small fixes to match Makefile behaviour)
6 years ago
Martin Kroeker
ae9e8b131e
Add mingw builds to Appveyor config
6 years ago
Martin Kroeker
9086543f50
Utest needs CBLAS but not necessarily FORTRAN
6 years ago
Martin Kroeker
abea977ded
Merge pull request #2162 from martin-frbg/pgi
Fixes for PGI compiler
6 years ago
Martin Kroeker
6b6c9b1441
Merge pull request #2172 from quickwritereader/develop
power9 cgemm/ctrmm. new sgemm 8x16
6 years ago
AbdelRauf
a97b301aaa
cgemm/ctrmm power9
6 years ago
Martin Kroeker
2f13f04224
Merge pull request #2170 from pkubaj/patch-1
Fix build on PPC970 for FreeBSD
6 years ago
pkubaj
7c7505a778
Fix build for PPC970 on FreeBSD pt.2
FreeBSD needs those macros too.
6 years ago
pkubaj
5a4f1a2118
Fix build for PPC970 on FreeBSD pt. 1
FreeBSD needs DCBT_ARG=0 as well.
6 years ago
Martin Kroeker
3b761892df
Merge pull request #2169 from pkubaj/develop
Fix build on FreeBSD/powerpc64.
6 years ago
Piotr Kubaj
eebfeba768
Fix build on FreeBSD/powerpc64.
Signed-off-by: Piotr Kubaj <pkubaj@anongoth.pl>
6 years ago
Martin Kroeker
7684c4f8f8
PGI compiler does not like -march=native
6 years ago
Martin Kroeker
7faf42b7bb
Merge pull request #2167 from kavanabhat/dtrmm_power8_segfault
Fix DTRMMKERNEL register save for power8 64-bit mode (Fix for #2166 )
6 years ago
kavanabhat
a575f1e4c7
Update dtrmm_kernel_16x4_power8.S
6 years ago
AbdelRauf
cdbfb891da
new sgemm 8x16
6 years ago
Martin Kroeker
280552b988
Fix mov syntax
6 years ago
Martin Kroeker
bbd4bb0154
Zero ecx with a mov instruction
PGI assembler does not like the initialization in the constraints.
6 years ago
Martin Kroeker
6d3efb2b58
Update Makefile.x86_64
6 years ago
Martin Kroeker
d9ff2cd90d
Do not force gcc options on non-gcc compilers
fixes compile failure with pgi 18.10 as reported on OpenBLAS-users
6 years ago
Martin Kroeker
2a43062de7
Merge pull request #2159 from martin-frbg/issue2149
Avoid unintentional activation of TLS codepath via USE_TLS=0
6 years ago
Martin Kroeker
4ea794a522
Avoid unintentional activation of TLS code via USE_TLS=0
fixes #2149
6 years ago
Martin Kroeker
ece0bfb881
Merge pull request #2158 from martin-frbg/issue2143
Remove any inadvertent use of -march=native from DYNAMIC_ARCH builds
6 years ago
Martin Kroeker
1f4b6a5d5d
Remove any inadvertent use of -march=native from DYNAMIC_ARCH builds
from #2143 , -march=native precludes use of more specific options like -march=skylake-avx512 in individual kernels, and defeats the purpose of dynamic arch anyway.
6 years ago
Martin Kroeker
be8f70d269
Merge pull request #2157 from martin-frbg/2154-2
Add gfortran workaround for potential ABI violation
6 years ago
Martin Kroeker
e674e1c735
Update fc.cmake
6 years ago
Martin Kroeker
6ca898b63b
Add gfortran workaround for potential ABI violation
for #2154
6 years ago
Martin Kroeker
26411acd56
Merge pull request #2148 from TiborGY/cpp_thread_test_2
Thread safety tester using C++11 threading (cleaned history)
6 years ago
Martin Kroeker
0ab4076dd8
Merge pull request #2156 from martin-frbg/issue2154
Add gfortran workaround for C->FORTRAN ABI violation
6 years ago