tingbo.liao
|
0bea1cfd9d
|
Optimize the zgemm_tcopy_4_rvv function to be compatible with the situations where the vector lengths(vlens) are 128 and 256.
Signed-off-by: tingbo.liao <tingbo.liao@starfivetech.com>
|
1 year ago |
tingbo.liao
|
d00cc400b1
|
Replaced the __riscv_vid_v_i32m2 and __riscv_vid_v_i64m2 with __riscv_vid_v_u32m2 and __riscv_vid_v_u64m2 for riscv64-unknown-linux-gnu-gcc compiling.
Signed-off-by: tingbo.liao <tingbo.liao@starfivetech.com>
|
1 year ago |
Martin Kroeker
|
229d8a025e
|
Merge pull request #4959 from CDAC-Bengaluru/level-1-sve
SVE Implementation for Level-1 BLAS Routines
|
1 year ago |
SushilPratap04
|
3368a4e697
|
Update swap_kernel_sve.c
|
1 year ago |
CDAC-SSDG
|
dd71e4234a
|
Added Updated swap and rot sve kernels.
|
1 year ago |
CDAC-SSDG
|
06ffd411a5
|
Update KERNEL.ARMV8SVE
|
1 year ago |
CDAC-SSDG
|
41912f9c22
|
Update CONTRIBUTORS.md
|
1 year ago |
CDAC-SSDG
|
765850194e
|
Delete kernel/arm64/swap_kernel_sve.c
|
1 year ago |
CDAC-SSDG
|
c17c19fbcf
|
Delete kernel/arm64/swap_kernel_c.c
|
1 year ago |
CDAC-SSDG
|
f6416c0e37
|
Delete kernel/arm64/swap.c
|
1 year ago |
CDAC-SSDG
|
3b7b74664c
|
Delete kernel/arm64/scal_kernel_sve.c
|
1 year ago |
CDAC-SSDG
|
95a97012e8
|
Delete kernel/arm64/scal_kernel_c.c
|
1 year ago |
CDAC-SSDG
|
5540f2121e
|
Delete kernel/arm64/scal.c
|
1 year ago |
CDAC-SSDG
|
f62519cc87
|
Delete kernel/arm64/rot_kernel_sve.c
|
1 year ago |
CDAC-SSDG
|
10857c9df4
|
Delete kernel/arm64/rot_kernel_c.c
|
1 year ago |
CDAC-SSDG
|
b9f51a5cf7
|
Delete kernel/arm64/rot.c
|
1 year ago |
Martin Kroeker
|
89f02ed394
|
Merge pull request #5014 from martin-frbg/issue5013
Add some missed lapack 3.11+ symbols to gensymbol
|
1 year ago |
Martin Kroeker
|
61d5aec7c1
|
remove typo
|
1 year ago |
Martin Kroeker
|
5aea097df0
|
add missing lapack 3.11+ symbols
|
1 year ago |
Martin Kroeker
|
72f7b7011c
|
Merge pull request #5009 from martin-frbg/pybenchdoc
DOCS, pybench : Add build notes for Windows and flang from gh Discussion 5008
|
1 year ago |
Martin Kroeker
|
0f8ff82592
|
Add build notes for Windows and flang from gh Discussion 5008
|
1 year ago |
Martin Kroeker
|
81666de4ef
|
Merge pull request #5007 from martin-frbg/issue5006
Revert the NRM2 kernels for NeoverseN2 and ARMV8SVE targets to the generic NEON version
|
1 year ago |
Martin Kroeker
|
230e665bca
|
Merge pull request #4996 from iha-taisei/sdgemv_sve_unroll
Loop-unrolled transposed [SD]GEMV kernels for A64FX and Neoverse V1
|
1 year ago |
Martin Kroeker
|
3345007d8f
|
retire the thunderx2 NRM2 kernels due to reported inaccuracies and NAN
|
1 year ago |
Martin Kroeker
|
5fe983db29
|
retire the thunderx2 nrm2 kernels for now due to NAN and inaccuracies
|
1 year ago |
Martin Kroeker
|
5dc4d7dd7e
|
Merge pull request #5005 from martin-frbg/evbarm
Improve support for NetBSD on arm64 (evbarm)
|
1 year ago |
Martin Kroeker
|
4ba471dd5a
|
Merge pull request #5003 from mathomp4/bugfix/nag-pic
Fixes for NAG Compiler
|
1 year ago |
Martin Kroeker
|
a791912cbb
|
handle uname returning evbarm on NetBSD
|
1 year ago |
Martin Kroeker
|
1a6ecda398
|
utilize /proc/cpuinfo on NetBSD too
|
1 year ago |
Matthew Thompson
|
c4e8bac5a5
|
Fix indent
|
1 year ago |
Matthew Thompson
|
d3b2036d49
|
Move to use ERROR STOP instead of ABORT
|
1 year ago |
Matthew Thompson
|
35334ed2ea
|
Fixes for Fortran Standards violations for lapack-netlib
|
1 year ago |
Matthew Thompson
|
be19966d3b
|
Fixes for NAG CMake
|
1 year ago |
Martin Kroeker
|
9c5d20187b
|
Merge pull request #4999 from dg0yt/macro-failed
Fix redefinition of FAILED
|
1 year ago |
Matthew Thompson
|
2eaf285de5
|
Use F_COMPILER name
|
1 year ago |
Matthew Thompson
|
a8b1705dbd
|
CMake build has wrong PIC flag for NAG
|
1 year ago |
Martin Kroeker
|
5f65846691
|
Merge pull request #4998 from dg0yt/arm-type-function
arm: Declare symbols as .type function
|
1 year ago |
Kai Pastor
|
93eb42fdc8
|
Fix redefinition of FAILED
|
1 year ago |
Kai Pastor
|
dc905636d1
|
arm: Declare symbols as .type function
|
1 year ago |
Iha, Taisei
|
4918beecbe
|
Loop-unrolled transposed [SD]GEMV kernels for A64FX and Neoverse V1
|
1 year ago |
Martin Kroeker
|
0578a89afd
|
Merge pull request #4993 from martin-frbg/issue4991
Translate CMAKE_SYSTEM_NAME in compilations on or for IOS
|
1 year ago |
Martin Kroeker
|
57a51d74c9
|
translate CMAKE_SYSTEM_NAME in compilations on or for IOS
|
1 year ago |
Martin Kroeker
|
35f2e6afe6
|
Merge pull request #4992 from mmuetzel/ci-msys2
CI (MinGW): Remove CLANG32 environment from build matrix.
|
1 year ago |
Markus Mützel
|
f5e6b5b5c9
|
CI (MinGW): Remove CLANG32 environment from build matrix.
The CLANG32 environment is in the process of being removed from MSYS2
currently:
https://www.msys2.org/news/#2024-09-23-starting-to-drop-the-clang32-environment
Remove it from the build matrix ahead of its complete removal from MSYS2.
|
1 year ago |
Martin Kroeker
|
8e8003a2d1
|
Merge pull request #4180 from mmuetzel/cmake
CI (MinGW): Remove work-around needed for old versions of LLVM Flang
|
1 year ago |
Martin Kroeker
|
71963a7bc4
|
Merge pull request #4985 from CheryDan/RISCV/sched
added optimizations for RISC-V YIELDING
|
1 year ago |
Markus Mützel
|
7452af4471
|
CI (MinGW): Remove work-around with NO_AVX512 that was needed for older versions of LLVM Flang.
|
2 years ago |
Martin Kroeker
|
82088cb266
|
Merge pull request #4986 from martin-frbg/readme_compilers
Add compiler version notes and mention the f2c fallback LAPACK in the README
|
1 year ago |
Martin Kroeker
|
8481301f1a
|
Merge pull request #4987 from martin-frbg/issue3973
Update build instructions for WoA (use LLVM19 and its flang-new)
|
1 year ago |
Martin Kroeker
|
009c1e0387
|
fix download link for the current WoA binary of LLVM
|
1 year ago |