Martin Kroeker
2bbb31e78f
Update mips64.yml
1 year ago
Martin Kroeker
145a975a8a
add libglib2/gthread2 for qemu
1 year ago
Martin Kroeker
e0285c0174
add glib2.0 for qemu
1 year ago
Martin Kroeker
ef49e4113e
fix ubuntu at 22 for compatibility and update run action to v3
1 year ago
Martin Kroeker
01ab072c2b
Ubuntu 24 now comes with libtinfo6 rather than 5
1 year ago
Martin Kroeker
610a28ceca
ubuntu 24 now comes with libtinfo6 rather than 5
1 year ago
Martin Kroeker
a7483d181b
Merge pull request #5074 from tingboliao/develop
Optimize the gemm_tcopy_8_rvv to be compatible with the vlens 128 and 256.
1 year ago
tingbo.liao
ef7f54b357
Optimized the gemm_tcopy_8_rvv to be compatible with the vlens 128 and 256.
Signed-off-by: tingbo.liao <tingbo.liao@starfivetech.com>
1 year ago
Martin Kroeker
eba7338484
Merge pull request #5073 from XiWeiGu/la64_update_symv_lsx_version
LoongArch64: Update symv lsx version
1 year ago
gxw
e0a8216554
LoongArch64: Update dsymv LSX version
1 year ago
gxw
a9070ba3f9
LoongArch64: Update ssymv LSX version
1 year ago
Martin Kroeker
9b981035db
Merge pull request #5070 from xry111/xry111/lasx-la664
LoongArch64: Fix dsymv and ssymv LASX version
1 year ago
Martin Kroeker
fee353e63d
Merge pull request #5072 from martin-frbg/azureosx13
Azure CI: update deprecated macos-12 jobs to macos-13 image
1 year ago
Martin Kroeker
0c0112dfef
update deprecated macos-12 jobs to macos-13 image
1 year ago
Xi Ruoyao
af10c132b8
LoongArch64: Fix dsymv and ssymv LASX version
"fmov.d $f2, $f4" leaves all the bits higher than the 63-th bit
unpredictable but it's obvious that the following code uses the value of
those high bits. We actually want to replicate the lower 64 bits here,
so we should use xvreplve0.d instead.
LA464 (Loongson 3[A-Z]-5000) happens to replicate them for us due to
some uarch internal details so the issue was not detected, but for LA664
(Loongson 3[A-Z]-6000) and future uarch we need to do things correctly
or we end up getting a lot of test failures.
Closes: https://bbs.aosc.io/t/topic/302
Signed-off-by: Xi Ruoyao <xry111@xry111.site>
1 year ago
Martin Kroeker
4e817f804c
Update version to 0.3.29.dev
1 year ago
Martin Kroeker
8a316e68a5
Update version to 0.3.29.dev
1 year ago
Martin Kroeker
07756abb3e
Merge pull request #5067 from OpenMathLib/release-0.3.0
merge release 0.3.29 back into develop to copy tag
1 year ago
Martin Kroeker
8795fc7985
set version to 0.3.29
1 year ago
Martin Kroeker
e0c134e1f6
set version to 0.3.29
1 year ago
Martin Kroeker
9207052d85
Merge pull request #5066 from OpenMathLib/develop
Merge changes from develop in preparation of the 0.3.29 release
1 year ago
Martin Kroeker
7f5b703a80
Merge pull request #5065 from martin-frbg/changelog0329
Update the Changelog for version 0.3.29
1 year ago
Martin Kroeker
20f6114e98
add descriptions of build/runtime vars to 0.3.29 improvements
1 year ago
Martin Kroeker
f422845b6d
Merge pull request #5064 from martin-frbg/lapack1080
Replace LAPACK ?LARFT with a recursive implementation (Reference-LAPACK PR 1080)
1 year ago
Martin Kroeker
ce66ffe7bb
Update the Changelog for version 0.3.29
1 year ago
Martin Kroeker
d035e80d33
move the original non-recursive ?LARFT here (Reference-LAPACK PR 1080)
1 year ago
Martin Kroeker
459fa8102b
Create subdirectory for the old non-recursive ?larft
1 year ago
Martin Kroeker
0c4b4cd78c
move the non-recursive original ?larft here (Reference-LAPACK PR 1080)
1 year ago
Martin Kroeker
ed516994d6
replace ?larft with a recursive implementation (Reference-LAPACK PR 1080)
1 year ago
Martin Kroeker
5527eda561
Merge pull request #5063 from martin-frbg/lapack1062
Remove comparison that is always false (Reference-LAPACK PR 1062)
1 year ago
Martin Kroeker
4c1a23673a
Remove comparison that is always false (Reference-LAPACK PR 1062)
1 year ago
Martin Kroeker
d74eb02954
Merge pull request #5057 from martin-frbg/issue5050
Replace while loop in generic C/ZGEMM_BETA to avoid going out of bounds
1 year ago
Martin Kroeker
30f7a4120b
Merge pull request #5056 from tingboliao/dev_omatcopy_20250108
Optimize the omatcopy_cn/zomatcopy_cn kernels with RVV 1.0 intrinsic.
1 year ago
Martin Kroeker
0b9de3ef7d
Merge pull request #5042 from tingboliao/develop
Add the test cases of rot to improve the unit tests for rot_rvv.
1 year ago
Martin Kroeker
c31f148c76
Merge pull request #5061 from XiWeiGu/la64_update_symv
LoongArch64: Update symv
1 year ago
gxw
20a8e48f25
LoongArch64: Update ssymv LASX version
1 year ago
gxw
e0748588b8
LoongArch64: Update dsymv LASX version
1 year ago
Martin Kroeker
d91d4fa6e9
convert the beta=0 branch to a for loop as well
1 year ago
Martin Kroeker
8cc32f5461
Merge branch 'OpenMathLib:develop' into issue5050
1 year ago
Martin Kroeker
a588ea90fd
Merge pull request #5052 from martin-frbg/issue5051
Do not check LDVT in LAPACKE_?gesvd_work if VT is not used anyway
1 year ago
Martin Kroeker
21013fdb66
Merge pull request #5058 from martin-frbg/cirrusbsd3
Cirrus CI: Update FreeBSD jobs to 14.1
1 year ago
Martin Kroeker
b67a963412
gcc remains at 13 even for freebsd-14.1
1 year ago
Martin Kroeker
a9eec233d3
Update FreeBSD jobs to 14.1
1 year ago
Martin Kroeker
09e75f1588
fix absurd typo
1 year ago
Martin Kroeker
2891fd8d6d
Replace while loop with for
1 year ago
tingbo.liao
0a5dbf13d3
Optimize the omatcopy_cn and zomatcopy_cn kernels with RVV 1.0 intrinsic.
Signed-off-by: tingbo.liao <tingbo.liao@starfivetech.com>
1 year ago
Martin Kroeker
69d3541488
Merge pull request #5055 from fleclairTT/develop
remove fastmath from x280 target
1 year ago
Felix LeClair
d0350a08b7
Merge pull request #1 from fleclairTT/fleclairTT-remove-fast-math
Update Makefile.riscv64
1 year ago
Felix LeClair
05dce05c24
Update Makefile.riscv64
remove fast-math
1 year ago
Martin Kroeker
8385e02ae1
Do not check LDVT when VT is not going to be referenced
1 year ago