Martin Kroeker
180ba5e7d0
Merge pull request #5069 from tingboliao/dev_rotm_20250107
Further rearranged the rotm kernel for the different architectures.
1 year ago
tingbo.liao
3c8df6358f
Further rearranged the rotm kernel for the different architectures.
Signed-off-by: tingbo.liao <tingbo.liao@starfivetech.com>
1 year ago
gxw
e0a8216554
LoongArch64: Update dsymv LSX version
1 year ago
gxw
a9070ba3f9
LoongArch64: Update ssymv LSX version
1 year ago
Xi Ruoyao
af10c132b8
LoongArch64: Fix dsymv and ssymv LASX version
"fmov.d $f2, $f4" leaves all the bits higher than the 63-th bit
unpredictable but it's obvious that the following code uses the value of
those high bits. We actually want to replicate the lower 64 bits here,
so we should use xvreplve0.d instead.
LA464 (Loongson 3[A-Z]-5000) happens to replicate them for us due to
some uarch internal details so the issue was not detected, but for LA664
(Loongson 3[A-Z]-6000) and future uarch we need to do things correctly
or we end up getting a lot of test failures.
Closes: https://bbs.aosc.io/t/topic/302
Signed-off-by: Xi Ruoyao <xry111@xry111.site>
1 year ago
gxw
20a8e48f25
LoongArch64: Update ssymv LASX version
1 year ago
gxw
e0748588b8
LoongArch64: Update dsymv LASX version
1 year ago
gxw
bb31bbef52
LoongArch64: Opt somatcopy_ct with LASX
1 year ago
gxw
b37129341b
LoongArch64: Opt somatcopy_cn with LASX
1 year ago
gxw
acf6cab304
LoongArch64: Opt somatcopy_rn with LASX
1 year ago
gxw
15edb441bf
LoongArch64: Opt somatcopy_rt with LASX
1 year ago
Martin Kroeker
9783dd07ab
Rename KERNEL.LOONGSONGENERIC to KERNEL.LA64_GENERIC
1 year ago
Martin Kroeker
de421b7764
Merge pull request #4904 from XiWeiGu/la64_cross_cmake
LoongArch64: Enable cmake cross-compilation
1 year ago
gxw
30af9278dc
LoongArch64: Enable cmake cross-compilation
1 year ago
gxw
48698b2b1d
LoongArch64: Rename core
Use microarchitecture name instead of meaningless strings to name the core,
the legacy core is still retained.
1. Rename LOONGSONGENERIC to LA64_GENERIC
2. Rename LOONGSON3R5 to LA464
3. Rename LOONGSON2K1000 to LA264
1 year ago
Martin Kroeker
e05d98d00a
expressly use fld.d/fst.d for floating point registers instead of LD/ST macros
1 year ago
gxw
3f39c8f94f
LoongArch: Fixed numpy CI failure
1 year ago
gxw
af73ae6208
LoongArch: Fixed issue 4728
2 years ago
gxw
8ab2e9ec65
LoongArch: DGEMM small matrix opt
2 years ago
Martin Kroeker
8da6f7e5f2
Merge pull request #4686 from XiWeiGu/loongarch64_dgemm_kernel_16x6
Loongarch64: Improving the Performance and Stability of dgemm
2 years ago
gxw
f9a26240a7
loongarch64: Fixed icamax_lsx
2 years ago
gxw
cb0f707409
loongarch64: Fixed utest fork:safety
2 years ago
Martin Kroeker
b45d8e1ab2
remove stray comma
2 years ago
gxw
6017ad7146
loongarch64: Update dgemm_kernel_16x4 to dgemm_kernel_16x6
2 years ago
Martin Kroeker
992b71fea2
remove stray comma
2 years ago
gxw
7cd438a5ac
loongarch64: Fixed clang compilation issues
2 years ago
gxw
96607cbb98
loongarch: Fixed dzamax
Initialize the registers to prevent sporadic errors.
2 years ago
gxw
50869f6ca8
loongarch: Fixed zrot LSX opt
2 years ago
gxw
b5eb9d6bac
loongarch: Fixed {sc/dz}amax LSX opt
2 years ago
gxw
ad13e04669
loongarch: Fixed {s/d/sc/dz}amin LSX opt
2 years ago
gxw
bbf82cb624
loongarch: Fixed {s/d}axpby LSX opt
2 years ago
gxw
ac460eb42a
loongarch: Fixed i{c/z}amin LSX opt
2 years ago
gxw
60e251a1f8
loongarch: Fixed {sc/dz}amax LASX opt
2 years ago
gxw
a10dde5554
loongarch: Fixed {s/d/sc/dz}amin LASX opt
2 years ago
gxw
6534d378b7
loongarch: Fixed {s/d/c/z}sum LASX opt
2 years ago
gxw
6159cffc58
loongarch: Fixed i{s/c/z}amin LASX opt
2 years ago
gxw
7d755912b9
loongarch: Fixed {s/d/c/z}axpby LASX opt
2 years ago
pengxu
680a77fafc
Optimized ssymv and dsymv kernel LSX for LoongArch
2 years ago
pengxu
6546600342
Optimized ssymv and dsymv kernel LASX for LoongArch
2 years ago
Martin Kroeker
577d480c62
Merge pull request #4529 from ErnstPeng/feature-branch
Optimized sgemv and dgemv kernel LSX for LoongArch
2 years ago
pengxu
b2db064285
Optimized sgemv and dgemv kernel LSX for LoongArch
2 years ago
gxw
8e05c053be
LoongArch64:Fixed the failed test cases test_{c/z}gemv_n in test_extensions
2 years ago
gxw
3f22fc2233
LoongArch64: Add zgemv LSX opt
2 years ago
gxw
c508a10cf2
LoongArch64: Add cgemv LSX opt
2 years ago
gxw
8dea25ffff
LoongArch64: Fixed utest kernel_regress:skx_avx
2 years ago
gxw
990507e3b8
LoongArch64: Opt zgemv with LASX
2 years ago
gxw
d51ffec3a2
LoongArch64: Opt cgemv with LASX
2 years ago
pengxu
4787a55c64
Optimized cgemm kernel 16x4 LASX for LoongArch
2 years ago
pengxu
fe3da43b7d
Optimized zgemm kernel 8*4 LASX, 4*4 LSX and cgemm kernel 8*4 LSX for LoongArch
2 years ago
Martin Kroeker
b537528feb
Merge pull request #4480 from XiWeiGu/loongarch64-fixed-{s/d}amin-lsx
LoongArch64: Fixed {s/d}amin LSX optimization
2 years ago