You can not select more than 25 topics Topics must start with a chinese character,a letter or number, can include dashes ('-') and can be up to 35 characters long.
 
 
 
 
 
 
yancheng d32f38fb37 loongarch64: Add optimizations for nrm2. 2 years ago
..
KERNEL LoongArch64: Add DYNAMIC_ARCH support 3 years ago
KERNEL.LOONGSON2K1000 loongarch64: Add optimizations for nrm2. 2 years ago
KERNEL.LOONGSON3R5 loongarch64: Add optimizations for nrm2. 2 years ago
KERNEL.generic LoongArch64: Add dgemv_t_8_lasx.S and dgemv_n_8_lasx.S V2 2 years ago
Makefile Add support for LOONGARCH64 4 years ago
amax.S Add support for LOONGARCH64 4 years ago
amin.S Add support for LOONGARCH64 4 years ago
asum.S Delete the macro instruction "li" and use "li.d" instead 4 years ago
cnrm2.S Allow negative INCX (API change from version 3.10 of the reference implementation) 2 years ago
copy.S Delete the macro instruction "li" and use "li.d" instead 4 years ago
damax_lasx.S loongarch64: Add optimizations for amax. 2 years ago
damax_lsx.S loongarch64: Add optimizations for amax. 2 years ago
damin_lasx.S loongarch64: Add optimization for amin. 2 years ago
damin_lsx.S loongarch64: Add optimization for amin. 2 years ago
dasum_lasx.S loongarch64: Add optimizations for sum and asum. 2 years ago
dasum_lsx.S loongarch64: Add optimizations for sum and asum. 2 years ago
daxpby_lasx.S loongarch64: Add optimizations for axpy and axpby. 2 years ago
daxpby_lsx.S loongarch64: Add optimizations for axpy and axpby. 2 years ago
daxpy_lasx.S loongarch64: Add optimizations for axpy and axpby. 2 years ago
daxpy_lsx.S loongarch64: Add optimizations for axpy and axpby. 2 years ago
dcopy_lasx.S loongarch64: Add optimizations for copy. 2 years ago
dcopy_lsx.S loongarch64: Add optimizations for copy. 2 years ago
dgemm_kernel_16x4.S LoongArch64: Update dgemm kernel 2 years ago
dgemm_ncopy_4.S loongarch64: Optimize dgemm_kernel 4 years ago
dgemm_ncopy_16.S loongarch64: Optimize dgemm_kernel 4 years ago
dgemm_tcopy_4.S loongarch64: Optimize dgemm_kernel 4 years ago
dgemm_tcopy_16.S loongarch64: Optimize dgemm_kernel 4 years ago
dgemv_n_8_lasx.S LoongArch64: Fixed compilation issues when enable DYNAMIC_ARCH 2 years ago
dgemv_t_8_lasx.S LoongArch64: Fixed compilation issues when enable DYNAMIC_ARCH 2 years ago
dmax_lasx.S loongarch64: Add optimization for max. 2 years ago
dmax_lsx.S loongarch64: Add optimization for max. 2 years ago
dmin_lasx.S loongarch64: Add optimization for min. 2 years ago
dmin_lsx.S loongarch64: Add optimization for min. 2 years ago
dnrm2.S Allow negative INCX (API change from version 3.10 of the reference implementation) 2 years ago
dnrm2_lasx.S loongarch64: Add optimizations for nrm2. 2 years ago
dnrm2_lsx.S loongarch64: Add optimizations for nrm2. 2 years ago
dot.S Delete the macro instruction "li" and use "li.d" instead 4 years ago
dot_lasx.S loongarch: Add optimization for dsdot kernel. 2 years ago
dot_lsx.S loongarch: Add LSX optimization for dot. 2 years ago
drot_lasx.S loongarch64: Add optimizations for rot. 2 years ago
drot_lsx.S loongarch64: Add optimizations for rot. 2 years ago
dscal_lasx.S loongarch64: Add optimizations for scal. 2 years ago
dscal_lsx.S loongarch64: Add optimizations for scal. 2 years ago
dsum_lasx.S loongarch64: Add optimizations for sum and asum. 2 years ago
dsum_lsx.S loongarch64: Add optimizations for sum and asum. 2 years ago
dswap_lasx.S loongarch64: Add optimizations for swap. 2 years ago
dswap_lsx.S loongarch64: Add optimizations for swap. 2 years ago
dtrsm_kernel_LN_16x4_lasx.S LoongArch64: Add dtrsm kernel 2 years ago
dtrsm_kernel_LT_16x4_lasx.S LoongArch64: Add dtrsm kernel 2 years ago
dtrsm_kernel_RN_16x4_lasx.S LoongArch64: Add dtrsm kernel 2 years ago
dtrsm_kernel_RT_16x4_lasx.S LoongArch64: Add dtrsm kernel 2 years ago
dtrsm_kernel_macro.S LoongArch64: Add dtrsm kernel 2 years ago
gemm_kernel.S Add support for LOONGARCH64 4 years ago
gemv_n.S Delete the macro instruction "li" and use "li.d" instead 4 years ago
gemv_t.S Delete the macro instruction "li" and use "li.d" instead 4 years ago
iamax.S Delete the macro instruction "li" and use "li.d" instead 4 years ago
iamin.S Delete the macro instruction "li" and use "li.d" instead 4 years ago
idamax_lasx.S loongarch64: Add optimizations for iamax. 2 years ago
idamax_lsx.S loongarch64: Add optimizations for iamax. 2 years ago
idamin_lasx.S loongarch64: Add optimizations for iamin. 2 years ago
idamin_lsx.S loongarch64: Add optimizations for iamin. 2 years ago
idmax_lasx.S loongarch64: Add optimizations for imax. 2 years ago
idmax_lsx.S loongarch64: Add optimizations for imax. 2 years ago
idmin_lasx.S loongarch64: Add optimizations for imin. 2 years ago
idmin_lsx.S loongarch64: Add optimizations for imin. 2 years ago
isamax_lasx.S loongarch64: Add optimizations for iamax. 2 years ago
isamax_lsx.S loongarch64: Add optimizations for iamax. 2 years ago
isamin_lasx.S loongarch64: Add optimizations for iamin. 2 years ago
isamin_lsx.S loongarch64: Add optimizations for iamin. 2 years ago
ismax_lasx.S loongarch64: Add optimizations for imax. 2 years ago
ismax_lsx.S loongarch64: Add optimizations for imax. 2 years ago
ismin_lasx.S loongarch64: Add optimizations for imin. 2 years ago
ismin_lsx.S loongarch64: Add optimizations for imin. 2 years ago
izamax.S Delete the macro instruction "li" and use "li.d" instead 4 years ago
izamin.S Delete the macro instruction "li" and use "li.d" instead 4 years ago
loongarch64_asm.S LoongArch64: Compatible with early internal toolchain 2 years ago
max.S Add support for LOONGARCH64 4 years ago
min.S Add support for LOONGARCH64 4 years ago
samax_lasx.S loongarch64: Add optimizations for amax. 2 years ago
samax_lsx.S loongarch64: Add optimizations for amax. 2 years ago
samin_lasx.S loongarch64: Add optimization for amin. 2 years ago
samin_lsx.S loongarch64: Add optimization for amin. 2 years ago
sasum_lasx.S loongarch64: Add optimizations for sum and asum. 2 years ago
sasum_lsx.S loongarch64: Add optimizations for sum and asum. 2 years ago
saxpby_lasx.S loongarch64: Add optimizations for axpy and axpby. 2 years ago
saxpby_lsx.S loongarch64: Add optimizations for axpy and axpby. 2 years ago
saxpy_lasx.S loongarch64: Add optimizations for axpy and axpby. 2 years ago
saxpy_lsx.S loongarch64: Add optimizations for axpy and axpby. 2 years ago
scal.S Delete the macro instruction "li" and use "li.d" instead 4 years ago
scopy_lasx.S loongarch64: Add optimizations for copy. 2 years ago
scopy_lsx.S loongarch64: Add optimizations for copy. 2 years ago
sgemm_kernel_16x8_lasx.S LoongArch64: Compatible with early internal toolchain 2 years ago
sgemm_ncopy_8_lasx.S LoongArch64: Add sgemm_kernel 2 years ago
sgemm_ncopy_16_lasx.S LoongArch64: Add sgemm_kernel 2 years ago
sgemm_tcopy_8_lasx.S LoongArch64: Add sgemm_kernel 2 years ago
sgemm_tcopy_16_lasx.S LoongArch64: Add sgemm_kernel 2 years ago
sgemv_n_8_lasx.S LoongArch64: Fixed compilation issues when enable DYNAMIC_ARCH 2 years ago
sgemv_t_8_lasx.S LoongArch64: Fixed compilation issues when enable DYNAMIC_ARCH 2 years ago
smax_lasx.S loongarch64: Add optimization for max. 2 years ago
smax_lsx.S loongarch64: Add optimization for max. 2 years ago
smin_lasx.S loongarch64: Add optimization for min. 2 years ago
smin_lsx.S loongarch64: Add optimization for min. 2 years ago
snrm2.S Allow negative INCX (API change from version 3.10 of the reference implementation) 2 years ago
snrm2_lasx.S loongarch64: Add optimizations for nrm2. 2 years ago
snrm2_lsx.S loongarch64: Add optimizations for nrm2. 2 years ago
srot_lasx.S loongarch64: Add optimizations for rot. 2 years ago
srot_lsx.S loongarch64: Add optimizations for rot. 2 years ago
sscal_lasx.S loongarch64: Add optimizations for scal. 2 years ago
sscal_lsx.S loongarch64: Add optimizations for scal. 2 years ago
ssum_lasx.S loongarch64: Add optimizations for sum and asum. 2 years ago
ssum_lsx.S loongarch64: Add optimizations for sum and asum. 2 years ago
sswap_lasx.S loongarch64: Add optimizations for swap. 2 years ago
sswap_lsx.S loongarch64: Add optimizations for swap. 2 years ago
swap.S Delete the macro instruction "li" and use "li.d" instead 4 years ago
trsm_kernel_LN.S Add support for LOONGARCH64 4 years ago
trsm_kernel_LT.S Add support for LOONGARCH64 4 years ago
trsm_kernel_RT.S Add support for LOONGARCH64 4 years ago
zamax.S Add support for LOONGARCH64 4 years ago
zamin.S Add support for LOONGARCH64 4 years ago
zasum.S Add support for LOONGARCH64 4 years ago
zcopy.S Delete the macro instruction "li" and use "li.d" instead 4 years ago
zdot.S Delete the macro instruction "li" and use "li.d" instead 4 years ago
zgemm3m_kernel.S Add support for LOONGARCH64 4 years ago
zgemm_kernel.S Add support for LOONGARCH64 4 years ago
zgemv_n.S Delete the macro instruction "li" and use "li.d" instead 4 years ago
zgemv_t.S Delete the macro instruction "li" and use "li.d" instead 4 years ago
znrm2.S Allow negative INCX (API change from version 3.10 of the reference implementation) 2 years ago
zscal.S Delete the macro instruction "li" and use "li.d" instead 4 years ago
ztrsm_kernel_LT.S Add support for LOONGARCH64 4 years ago
ztrsm_kernel_RT.S Add support for LOONGARCH64 4 years ago