Martin Kroeker
e55ec82bb9
Delete KERNEL.1004K
5 years ago
Martin Kroeker
7353ea5afc
Delete KERNEL.24K
5 years ago
Martin Kroeker
6a04efb122
Rename KERNEL files to include MIPS prefix
5 years ago
Martin Kroeker
5afb66812f
Update getarch.c
5 years ago
Martin Kroeker
0d18f231fc
Update getarch.c
5 years ago
Martin Kroeker
2f4a8e5bc4
Rename the FORCE entries for 24K and 1004K to include the MIPS prefix
5 years ago
Martin Kroeker
8792fc4d5f
Disable RPCC macro on MIPS24K
5 years ago
Martin Kroeker
577c5d9f8f
Update README.md
5 years ago
Martin Kroeker
6721f2750e
Update TargetList.txt
5 years ago
Martin Kroeker
b0b02a080d
Add compiler options for MIPS32 24K/1004K
5 years ago
Martin Kroeker
a1fc98dc57
rename 1004K, 24K to MIPS1004K, MIPS24K to avoid identifier naming problem
5 years ago
Martin Kroeker
00172d440b
Typo fix in MIPS24K addition
5 years ago
Martin Kroeker
d712ea724c
Add MIPS24K support
5 years ago
Martin Kroeker
61bbae3ac1
Handle MIPS24K like P5600
and allow enforcing TARGET=1004K as well (omission from earlier 1004K merge and later introduction of TARGET check)
5 years ago
Martin Kroeker
1c1ca2bc0a
Merge pull request #47 from xianyi/develop
rebase
5 years ago
Martin Kroeker
236a3d8ce6
Merge pull request #2563 from zelong-1024/develop
[OpenBLAS]: benchmark error of potrf
5 years ago
l00536773
6b7ef6543a
[OpenBLAS]: benchmark error of potrf
[description]: when the matrix size goes higher than 5800 during the cpotrf test, error info, such as "Potrf info = 5679", will be returned on ARM64 and x86 machines. Uplo = L & F.
[solution]: changed the func for building the matrix so that the complex Hermitian matrix can stay positive definite during the computation.
[dts]:
5 years ago
Martin Kroeker
250e6f8039
Merge pull request #2557 from martin-frbg/dronebadge
Update and reformat README
5 years ago
Martin Kroeker
7a6d0016b0
Merge pull request #2556 from martin-frbg/epicdrone
Add a drone.io multithread test for x86_64
5 years ago
Martin Kroeker
e8e8a6e608
Restore USE_OPENMP in the x86 thread test
5 years ago
Martin Kroeker
579811fb6a
Move all 19.04-based jobs back to ubuntu 18.04
5 years ago
Martin Kroeker
84a9614345
try x86_64 test without openmp
5 years ago
Martin Kroeker
b969533703
Add drone.io badge, mention EMAG8180 support, reformat the DYNAMIC_ARCH paragraph
5 years ago
Martin Kroeker
0f08f3efa6
Add a multithread test for x86_64
5 years ago
Martin Kroeker
c861b2a7bd
Merge pull request #2553 from martin-frbg/issue2444
Add a read memory barrier to the traversal of the buffer slot list
5 years ago
Martin Kroeker
cf62adffbb
Merge pull request #2555 from martin-frbg/issue1137
Handle unaligned data in the SSE2 copy kernel
5 years ago
Martin Kroeker
3eec7d382c
ARMV7 does not support DMB ISHLD, use DMB ISH
5 years ago
Martin Kroeker
5b0093b5fe
Convert aligned moves to unaligned
should have no performance impact on reasonably modern cpus and fixes occasional crashes in actual user code.
5 years ago
Martin Kroeker
f41600e66f
Add a read barrier in the traversing of the buffer list
Needed on systems with weak memory ordering - the inferior, partially working fix from #2544 was already removed in #2551
5 years ago
Martin Kroeker
f5efecb7ca
Add (empty) read barrier definition
5 years ago
Martin Kroeker
a52bdd9d7b
Add (empty) read barrier definition
5 years ago
Martin Kroeker
db3226a646
Add (empty) read barrier definition
5 years ago
Martin Kroeker
69b6e258d8
Add (empty) read barrier definition
5 years ago
Martin Kroeker
3d4db4d002
Add read barrier definition
5 years ago
Martin Kroeker
99dde1d2c9
Add read barrier definition
5 years ago
Martin Kroeker
ee6b3df02c
Add read barrier definition
5 years ago
Martin Kroeker
25e879fe92
Add (empty) read barrier definition
5 years ago
Martin Kroeker
d237dc1360
Add read barrier definition
5 years ago
Martin Kroeker
8692456226
Add read barrier definition
5 years ago
Martin Kroeker
d1d69e1b9a
Add read barrier definition
5 years ago
Martin Kroeker
20d0cb2f65
Merge pull request #46 from xianyi/develop
rebase
5 years ago
Martin Kroeker
e7f0da9295
Merge pull request #2551 from martin-frbg/issue2538-2
Increase BUFFER_SIZEs and add a safeguard; supply GEMM_R for POWER8/9
5 years ago
Martin Kroeker
e9bfa2291a
Fix parameter overflow
5 years ago
Martin Kroeker
2a28448a96
Add safeguards for sufficient BUFFER_SIZE
5 years ago
Martin Kroeker
a33d177430
Increase default BUFFER_SIZE on ARM, ZARCH and newer x86_64, add GEMM_R for POWER8/9
As shown in #2538 , default buffersizes on some platforms were smaller than required in memory.c
and the requirement could never be fulfilled for a calculated GEMM_R on PPC given the fomula used
5 years ago
Martin Kroeker
f73391c9c9
Merge pull request #45 from xianyi/develop
rebase
5 years ago
Martin Kroeker
7905383cb5
Merge pull request #2547 from sharvil/develop
Add API to set thread affinity on Linux.
5 years ago
Martin Kroeker
a8cbd451bf
Merge pull request #2541 from bapt/develop
libname: treat FreeBSD and DragonFly like linux and sunos
5 years ago
Martin Kroeker
eecd8c3204
Merge pull request #2548 from gxw-loongson/develop
Add a GENERIC target for 64bit MIPS
5 years ago
Martin Kroeker
ea85eb2e02
Merge pull request #2549 from martin-frbg/fixthreadtest
Match thread count in cpp_thread_test to host capability
5 years ago