gxw
bb31bbef52
LoongArch64: Opt somatcopy_ct with LASX
1 year ago
gxw
b37129341b
LoongArch64: Opt somatcopy_cn with LASX
1 year ago
gxw
acf6cab304
LoongArch64: Opt somatcopy_rn with LASX
1 year ago
gxw
15edb441bf
LoongArch64: Opt somatcopy_rt with LASX
1 year ago
Martin Kroeker
9783dd07ab
Rename KERNEL.LOONGSONGENERIC to KERNEL.LA64_GENERIC
1 year ago
Martin Kroeker
de421b7764
Merge pull request #4904 from XiWeiGu/la64_cross_cmake
LoongArch64: Enable cmake cross-compilation
1 year ago
gxw
30af9278dc
LoongArch64: Enable cmake cross-compilation
1 year ago
gxw
48698b2b1d
LoongArch64: Rename core
Use microarchitecture name instead of meaningless strings to name the core,
the legacy core is still retained.
1. Rename LOONGSONGENERIC to LA64_GENERIC
2. Rename LOONGSON3R5 to LA464
3. Rename LOONGSON2K1000 to LA264
1 year ago
Martin Kroeker
e05d98d00a
expressly use fld.d/fst.d for floating point registers instead of LD/ST macros
1 year ago
gxw
3f39c8f94f
LoongArch: Fixed numpy CI failure
1 year ago
gxw
af73ae6208
LoongArch: Fixed issue 4728
1 year ago
gxw
8ab2e9ec65
LoongArch: DGEMM small matrix opt
2 years ago
Martin Kroeker
8da6f7e5f2
Merge pull request #4686 from XiWeiGu/loongarch64_dgemm_kernel_16x6
Loongarch64: Improving the Performance and Stability of dgemm
1 year ago
gxw
f9a26240a7
loongarch64: Fixed icamax_lsx
1 year ago
gxw
cb0f707409
loongarch64: Fixed utest fork:safety
1 year ago
Martin Kroeker
b45d8e1ab2
remove stray comma
1 year ago
gxw
6017ad7146
loongarch64: Update dgemm_kernel_16x4 to dgemm_kernel_16x6
1 year ago
Martin Kroeker
992b71fea2
remove stray comma
1 year ago
gxw
7cd438a5ac
loongarch64: Fixed clang compilation issues
1 year ago
gxw
96607cbb98
loongarch: Fixed dzamax
Initialize the registers to prevent sporadic errors.
1 year ago
gxw
50869f6ca8
loongarch: Fixed zrot LSX opt
1 year ago
gxw
b5eb9d6bac
loongarch: Fixed {sc/dz}amax LSX opt
1 year ago
gxw
ad13e04669
loongarch: Fixed {s/d/sc/dz}amin LSX opt
1 year ago
gxw
bbf82cb624
loongarch: Fixed {s/d}axpby LSX opt
1 year ago
gxw
ac460eb42a
loongarch: Fixed i{c/z}amin LSX opt
1 year ago
gxw
60e251a1f8
loongarch: Fixed {sc/dz}amax LASX opt
1 year ago
gxw
a10dde5554
loongarch: Fixed {s/d/sc/dz}amin LASX opt
1 year ago
gxw
6534d378b7
loongarch: Fixed {s/d/c/z}sum LASX opt
1 year ago
gxw
6159cffc58
loongarch: Fixed i{s/c/z}amin LASX opt
1 year ago
gxw
7d755912b9
loongarch: Fixed {s/d/c/z}axpby LASX opt
1 year ago
pengxu
680a77fafc
Optimized ssymv and dsymv kernel LSX for LoongArch
1 year ago
pengxu
6546600342
Optimized ssymv and dsymv kernel LASX for LoongArch
1 year ago
Martin Kroeker
577d480c62
Merge pull request #4529 from ErnstPeng/feature-branch
Optimized sgemv and dgemv kernel LSX for LoongArch
1 year ago
pengxu
b2db064285
Optimized sgemv and dgemv kernel LSX for LoongArch
1 year ago
gxw
8e05c053be
LoongArch64:Fixed the failed test cases test_{c/z}gemv_n in test_extensions
1 year ago
gxw
3f22fc2233
LoongArch64: Add zgemv LSX opt
1 year ago
gxw
c508a10cf2
LoongArch64: Add cgemv LSX opt
1 year ago
gxw
8dea25ffff
LoongArch64: Fixed utest kernel_regress:skx_avx
1 year ago
gxw
990507e3b8
LoongArch64: Opt zgemv with LASX
1 year ago
gxw
d51ffec3a2
LoongArch64: Opt cgemv with LASX
1 year ago
pengxu
4787a55c64
Optimized cgemm kernel 16x4 LASX for LoongArch
1 year ago
pengxu
fe3da43b7d
Optimized zgemm kernel 8*4 LASX, 4*4 LSX and cgemm kernel 8*4 LSX for LoongArch
2 years ago
Martin Kroeker
b537528feb
Merge pull request #4480 from XiWeiGu/loongarch64-fixed-{s/d}amin-lsx
LoongArch64: Fixed {s/d}amin LSX optimization
2 years ago
gxw
adde725321
LoongArch64: Fixed {s/d}amin LSX optimization
2 years ago
gxw
7bc93d95a1
LoongArch64: Opt {c/z}axpby
2 years ago
gxw
1e1f487dc7
LoongArch64: Fixed {s/d}axpby
2 years ago
Martin Kroeker
98c9ff3194
Merge pull request #4464 from XiWeiGu/loongarch64-zscal
LoongArch64: Handle NAN and INF
2 years ago
gxw
83ce97a4ca
LoongArch64: Handle NAN and INF
2 years ago
gxw
a79d117405
LoogArch64: Fixed bug for {s/d}amin
2 years ago
gxw
276e3ebf9e
LoongArch64: Add dzamax and dzamin opt
2 years ago