Martin Kroeker
4fc17d0d75
Trivial typo fix
as suggested in #2022
7 years ago
Martin Kroeker
03d7110900
Merge pull request #2042 from maomao194313/develop
add TARGET support for HiSilicon tsv110 CPUs
7 years ago
Martin Kroeker
3ce28fb81a
Merge pull request #2055 from martin-frbg/atomid
Add CPUID data for Intel Denverton (as Nehalem)
7 years ago
Martin Kroeker
04f2226ea6
Add Intel Denverton
7 years ago
Martin Kroeker
b1393c7a97
Add Intel Denverton
for #2048
7 years ago
maomao194313
7e3eb9b25d
make DYNAMIC_ARCH=1 package work on TSV110
7 years ago
maomao194313
f074d7d146
make DYNAMIC_ARCH=1 package work on TSV110.
7 years ago
Martin Kroeker
f18ab6c17b
Merge pull request #2051 from martin-frbg/issue2048
Make TARGET=GENERIC compatible with DYNAMIC_ARCH=1
7 years ago
Martin Kroeker
946ec6c3b8
Merge pull request #2050 from kencu/PowerMacFix
PowerMac 970 fixes
7 years ago
Martin Kroeker
5b95534afc
Make TARGET=GENERIC compatible with DYNAMIC_ARCH=1
for issue #2048
7 years ago
ken-cunningham-webuse
f7a06463d9
common_power.h: force DCBT_ARG 0 on PPC970 Darwin
without this, we see
../kernel/power/gemv_n.S:427:Parameter syntax error
and many more similar entries
that relates to this assembly command
dcbt 8, r24, r18
this change makes the DCBT_ARG = 0
and openblas builds through to completion on PowerMac 970
Tests pass
7 years ago
ken-cunningham-webuse
b0c714ef60
param.h : enable defines for PPC970 on DarwinOS
fixes:
gemm.c: In function 'sgemm_':
../common_param.h:981:18: error: 'SGEMM_DEFAULT_P' undeclared (first use in this function)
#define SGEMM_P SGEMM_DEFAULT_P
^
7 years ago
Martin Kroeker
8d3d29e4d7
Merge pull request #2049 from Celelibi/fix_crash_sgemm_sse_x64
Fix crash in sgemm SSE/nano kernel on x86_64
7 years ago
Celelibi
b7f59da42d
Fix crash in sgemm SSE/nano kernel on x86_64
Fix bug #2047 .
Signed-off-by: Celelibi <celelibi@gmail.com>
7 years ago
Martin Kroeker
db3dc9e282
Merge pull request #2046 from kencu/powermac
ctest.c : add __POWERPC__ for PowerMac
7 years ago
ken-cunningham-webuse
4290afdae2
ctest.c : add __POWERPC__ for PowerMac
7 years ago
Martin Kroeker
4741ce803b
Merge pull request #2045 from martin-frbg/2033-3
Do not compile in AVX512 check if AVX support is disabled
7 years ago
Martin Kroeker
11cfd0bd75
Do not compile in AVX512 check if AVX support is disabled
xgetbv is function depends on NO_AVX being undefined - we could change that too, but that combo is unlikely to work anyway
7 years ago
Martin Kroeker
651ab01d2b
Merge pull request #2044 from martin-frbg/issue2043
Fix module definition conflicts between LAPACK and ReLAPACK
7 years ago
Martin Kroeker
d7b2c53c0b
Merge pull request #2039 from brada4/meminit
Address warning in memory.c
7 years ago
Martin Kroeker
e4864a8933
Fix module definition conflicts between LAPACK and ReLAPACK
for #2043
7 years ago
Martin Kroeker
10d841d8b9
Merge pull request #2026 from martin-frbg/trmv_threads
Correct range limiting in trmv_thread and re-enable TRMV multithreading
7 years ago
Martin Kroeker
12f2b76748
Merge pull request #2038 from martin-frbg/issue2035
Improve handling of NO_STATIC and NO_SHARED
7 years ago
Martin Kroeker
6c83b878f6
Merge pull request #2040 from martin-frbg/locks2002
Restore locking optimizations for OpenMP case
7 years ago
maomao194313
fb4dae7124
add TARGET support for HiSilicon tsv110 CPUs
7 years ago
maomao194313
760842dda1
add TARGET support for HiSilicon tsv110 CPUs
7 years ago
maomao194313
53f482ee72
add TARGET support for HiSilicon tsv110 CPUs
7 years ago
maomao194313
783ba8058f
HiSilicon tsv110 CPUs optimization branch
add HiSilicon tsv110 CPUs optimization branch
7 years ago
Martin Kroeker
af480b02a4
Restore locking optimizations for OpenMP case
restore another accidentally dropped part of #1468 that was missed in #2004 to address performance regression reported in #1461
7 years ago
Andrew
e4a79be6bb
address warning introed with #1814 et al
7 years ago
Andrew
e5c316c6b9
init
7 years ago
Martin Kroeker
25427926bc
Improve handling of NO_STATIC and NO_SHARED
to avoid surprises from defining either as zero. Fixes #2035 by addressing some concerns from #1422
7 years ago
Martin Kroeker
edb8143141
Merge pull request #2037 from martin-frbg/issue2033-2
Make sure that AVX512 is disabled in 32bit builds
7 years ago
Martin Kroeker
c4868d11c0
Make sure that AVX512 is disabled in 32bit builds
for #2033
7 years ago
Martin Kroeker
4c321ae571
Merge pull request #2034 from martin-frbg/issue2033
Make x86_32 imply NO_AVX2, NO_AVX512 in addition to NO_AVX
7 years ago
Martin Kroeker
2ffb727187
Keep xcode8.3 for osx BINARY=32 build
as xcode10 deprecated i386
7 years ago
Martin Kroeker
d66214c946
Make x86_32 imply NO_AVX2, NO_AVX512 in addition to NO_AVX
fixes #2033
7 years ago
Martin Kroeker
fd34820b99
Fix AVX512 test always returning false due to missing compiler option
7 years ago
Martin Kroeker
918a0cc4d1
Fix missing -c option in AVX512 test
7 years ago
Martin Kroeker
0db9c03e7e
Merge pull request #2028 from brada4/mv
Move one of clobber fixes to right place
7 years ago
Andrew
6eee1beac5
move fix to right place
7 years ago
Andrew
e5df5958cc
init
7 years ago
Martin Kroeker
343b301d14
Reduce list of kernels in the dynamic arch build
to make compilation complete reliably within the 1h limit again
7 years ago
Martin Kroeker
45333d5793
Fix error introduced during cleanup
7 years ago
Martin Kroeker
e29b0cfcc4
Allow multithreading TRMV again
revert workaround introduced for issue #1332 as the actual cause appears to be my incorrect fix from #1262 (see #1388 )
7 years ago
Martin Kroeker
78d9910236
Correct range_n limiting
same bug as seen in #1388 , somehow missed in corresponding PR #1389
7 years ago
Martin Kroeker
e12cdf58ef
Merge pull request #2024 from martin-frbg/gcc9fixes4
Fix inline assembly constraints in Bulldozer TRSM kernels
7 years ago
Martin Kroeker
1860c9456d
Merge pull request #2023 from martin-frbg/gcc9fixes3
Fix inline assembly constraints in various x86_64 GEMVN kernels
7 years ago
Martin Kroeker
aec905498f
Merge pull request #1988 from TiborGY/patch-1
Reword/expand comments in Makefile.rule
7 years ago
TiborGY
56089991e2
fix the the
7 years ago