d5e255519
(refs/pull/5035/head)
Improve OpenBLASConfig.cmake contents by
2024-12-29 22:38:23 +0100
df42f79c4
(refs/pull/5033/head)
docs: update extensions and install pages with last wiki edits by
2024-12-26 21:09:26 +0100
2fd0a129c
(refs/pull/5032/head)
count and sort cpu models on big.little systems by
2024-12-26 10:45:57 +0100
17803e790
Merge pull request #5031 from david-cortes/fix_doc_links by
2024-12-24 23:20:48 +0100
762fa1afa
(refs/pull/5031/head)
fix link to faq by
2024-12-24 19:48:04 +0100
b1a410371
deploy: 6af4e76f31 by
2024-12-24 15:10:54 +0000
6af4e76f3
Merge pull request #5029 from martin-frbg/issue5020 by
2024-12-24 16:10:20 +0100
fbf594b62
(refs/pull/5029/head)
Guard against empty CMAKE_Fortran_COMPILER_ID by
2024-12-24 13:34:33 +0100
c4c3d9e68
(refs/pull/5030/head)
Merge remote-tracking branch 'refs/remotes/origin/develop' into develop by
2024-12-24 10:36:53 +0800
0bea1cfd9
Optimize the zgemm_tcopy_4_rvv function to be compatible with the situations where the vector lengths(vlens) are 128 and 256. by
2024-12-24 10:33:27 +0800
e6fd62977
Expressly declare the .S extension for assembly (documented as standard, but current cmake does not set it for icx) by
2024-12-23 23:18:52 +0100
05fe49dda
Rename local copy functions to avoid name clash with the standard BLAS ones by
2024-12-23 19:12:17 +0100
64c6c7920
Assume no underline suffixes on symbols when compiling with Intel ifx on Windows by
2024-12-23 19:09:34 +0100
5c9417d30
Assume no underline suffixes on symbols when compiling with ifx on Windows by
2024-12-23 19:07:39 +0100
5d81e514e
Assume no underline suffixes on symbols when compiling with ifx on Windows by
2024-12-23 19:06:03 +0100
d78fbe425
Assume no underline suffixes on symbols when compiling with ifx on Windows by
2024-12-23 19:04:50 +0100
30188a55d
Don't assume underlined symbols for ifx; make cpuid.S inclusion conditional by
2024-12-23 19:02:34 +0100
32319a33a
Add options for Intel oneAPI 2025.0 ifx on Windows by
2024-12-23 19:00:48 +0100
2247bb54b
(refs/pull/5027/head)
Intel ifx on Windows uses LAPACK_COMPLEX_STRUCTURE by
2024-12-21 23:01:12 +0100
93da64d2b
Fix NEEDBUNDERSCORE conditional by
2024-12-21 22:56:54 +0100
07e304563
fix NEEDBUNDERSCORE conditional for the case it is defined to zero by
2024-12-21 22:55:07 +0100
0d1a27c2d
Intel ifx on Windows does not add underlines; cpuid.S is only needed on ancient i386 Macs by
2024-12-21 22:51:44 +0100
6df18dd59
The Windows version of Intel ifx does not add trailing underlines to symbols by
2024-12-21 22:50:26 +0100
eab9a77c1
Rename the local utility function to my_?copy to avoid symbol clash with the BLAS function by
2024-12-21 22:42:50 +0100
965e9bd56
deploy: 37a4ca7e46 by
2024-12-20 07:31:24 +0000
37a4ca7e4
Merge pull request #5025 from martin-frbg/nvidia_arm64 by
2024-12-19 23:30:53 -0800
1c4401ebf
(refs/pull/5025/head)
Add target-specific options to enable SVE with the NVIDIA compiler by
2024-12-19 14:32:24 -0800
9d42d2709
deploy: f2be482d43 by
2024-12-19 07:05:53 +0000
f2be482d4
Merge pull request #5024 from martin-frbg/issue5001 by
2024-12-18 23:05:20 -0800
2dea8992d
deploy: 70dddacb9f by
2024-12-19 00:57:34 +0000
70dddacb9
Merge pull request #5023 from rgommers/fix-warnings by
2024-12-18 16:13:12 -0800
a93d3db34
(refs/pull/5024/head)
fix formatting of WoA section by
2024-12-19 00:53:10 +0100
e46051268
Update WoA build instructions from rewording in issue #5001 by
2024-12-19 00:50:37 +0100
bf107ab6d
deploy: d3cc8c65ed by
2024-12-18 22:30:13 +0000
d3cc8c65e
Merge pull request #5022 from tingboliao/develop by
2024-12-18 14:29:39 -0800
765ad8bcd
(refs/pull/5023/head)
Fix guard around `alloc_hugetlb`, fixes compile warning by
2024-12-18 09:39:07 +0100
48caf2303
Fix build warning about discarding volatile qualifier in memory.c by
2024-12-18 08:53:29 +0100
d00cc400b
(refs/pull/5022/head)
Replaced the __riscv_vid_v_i32m2 and __riscv_vid_v_i64m2 with __riscv_vid_v_u32m2 and __riscv_vid_v_u64m2 for riscv64-unknown-linux-gnu-gcc compiling. by
2024-12-18 08:35:26 +0800
7c6325bee
deploy: 229d8a025e by
2024-12-13 13:21:24 +0000
229d8a025
Merge pull request #4959 from CDAC-Bengaluru/level-1-sve by
2024-12-13 05:20:51 -0800
3368a4e69
(refs/pull/4959/head)
Update swap_kernel_sve.c by
2024-12-13 16:47:58 +0530
dd71e4234
Added Updated swap and rot sve kernels. by
2024-12-13 11:15:29 +0530
06ffd411a
Update KERNEL.ARMV8SVE by
2024-12-13 11:05:47 +0530
41912f9c2
Update CONTRIBUTORS.md by
2024-12-13 11:05:10 +0530
765850194
Delete kernel/arm64/swap_kernel_sve.c by
2024-12-13 11:02:01 +0530
c17c19fbc
Delete kernel/arm64/swap_kernel_c.c by
2024-12-13 11:01:46 +0530
f6416c0e3
Delete kernel/arm64/swap.c by
2024-12-13 11:01:32 +0530
3b7b74664
Delete kernel/arm64/scal_kernel_sve.c by
2024-12-13 11:01:03 +0530
95a97012e
Delete kernel/arm64/scal_kernel_c.c by
2024-12-13 11:00:45 +0530
5540f2121
Delete kernel/arm64/scal.c by
2024-12-13 11:00:12 +0530
f62519cc8
Delete kernel/arm64/rot_kernel_sve.c by
2024-12-13 10:59:35 +0530
10857c9df
Delete kernel/arm64/rot_kernel_c.c by
2024-12-13 10:58:51 +0530
b9f51a5cf
Delete kernel/arm64/rot.c by
2024-12-13 10:58:06 +0530
9ef5f814e
(refs/pull/5011/merge)
Merge 3d282c93c5 into 89f02ed394 by
2024-12-11 17:54:47 +0000
3d282c93c
(refs/pull/5011/head)
Add Arm®v9-A architecture SME SGEMM kernels by
2024-12-09 08:18:20 +0000
b036235f3
Add Arm®v9-A architecture SME target by
2024-12-09 08:18:17 +0000
24502c821
deploy: 89f02ed394 by
2024-12-11 07:10:03 +0000
89f02ed39
Merge pull request #5014 from martin-frbg/issue5013 by
2024-12-10 23:09:33 -0800
61d5aec7c
(refs/pull/5014/head)
remove typo by
2024-12-11 00:41:56 +0100
5aea097df
add missing lapack 3.11+ symbols by
2024-12-10 23:52:05 +0100
59bdbd4fd
(refs/pull/5012/head)
Update arm64_graviton.yml by
2024-12-10 15:09:29 +0100
e9637ac7a
try inlining the small_matrix_permit by
2024-12-10 14:17:18 +0100
3060ebe7f
delete preferred sizes for neov1 by
2024-12-10 01:01:42 +0100
673323ff5
Update arm64_graviton.yml by
2024-12-10 00:15:36 +0100
ed42fa8a3
Update arm64_graviton.yml by
2024-12-09 23:34:49 +0100
508ddbeb5
temporarily upgrade to Graviton4 by
2024-12-09 22:55:20 +0100
e341ca30c
Update arm64_graviton.yml by
2024-12-09 21:15:09 +0100
c3d436b14
Update arm64_graviton.yml by
2024-12-09 20:34:36 +0100
85c3dd1b0
Update arm64_graviton.yml by
2024-12-09 19:53:35 +0100
d2a1054ce
Update arm64_graviton.yml by
2024-12-09 19:24:30 +0100
dc68a48dd
(refs/pull/5010/head)
Include v0p10 header when compiling RVV0.x code with the T-Head toolchain by
2024-12-06 12:00:10 +0100
d6ab04080
deploy: 72f7b7011c by
2024-12-06 10:50:48 +0000
72f7b7011
Merge pull request #5009 from martin-frbg/pybenchdoc by
2024-12-06 02:50:14 -0800
0f8ff8259
(refs/pull/5009/head)
Add build notes for Windows and flang from gh Discussion 5008 by
2024-12-06 01:35:42 -0800
81666de4e
Merge pull request #5007 from martin-frbg/issue5006 by
2024-12-05 14:43:03 -0800
230e665bc
Merge pull request #4996 from iha-taisei/sdgemv_sve_unroll by
2024-12-05 13:36:47 -0800
3345007d8
(refs/pull/5007/head)
retire the thunderx2 NRM2 kernels due to reported inaccuracies and NAN by
2024-12-05 21:12:06 +0100
5fe983db2
retire the thunderx2 nrm2 kernels for now due to NAN and inaccuracies by
2024-12-05 21:09:53 +0100
5dc4d7dd7
Merge pull request #5005 from martin-frbg/evbarm by
2024-12-05 00:02:58 -0800
c7b1e1af7
deploy: 4ba471dd5a by
2024-12-05 00:16:09 +0000
4ba471dd5
Merge pull request #5003 from mathomp4/bugfix/nag-pic by
2024-12-04 15:41:12 -0800
a791912cb
(refs/pull/5005/head)
handle uname returning evbarm on NetBSD by
2024-12-04 15:34:57 -0800
1a6ecda39
utilize /proc/cpuinfo on NetBSD too by
2024-12-04 15:32:26 -0800
c4e8bac5a
(refs/pull/5003/head)
Fix indent by
2024-12-04 12:11:35 -0500
d3b2036d4
Move to use ERROR STOP instead of ABORT by
2024-12-04 12:09:24 -0500
35334ed2e
Fixes for Fortran Standards violations for lapack-netlib by
2024-12-04 10:53:05 -0500
be19966d3
Fixes for NAG CMake by
2024-12-04 10:52:43 -0500
9c5d20187
Merge pull request #4999 from dg0yt/macro-failed by
2024-12-04 07:37:51 -0800
2eaf285de
Use F_COMPILER name by
2024-11-26 15:26:55 -0500
a8b1705db
CMake build has wrong PIC flag for NAG by
2024-11-26 15:21:28 -0500
62098f926
deploy: 5f65846691 by
2024-12-04 09:51:27 +0000
5f6584669
Merge pull request #4998 from dg0yt/arm-type-function by
2024-12-04 01:50:53 -0800
93eb42fdc
(refs/pull/4999/head)
Fix redefinition of FAILED by
2024-12-03 09:45:04 +0100
dc905636d
(refs/pull/4998/head)
arm: Declare symbols as .type function by
2024-12-03 07:42:44 +0100
4918beecb
(refs/pull/4996/head)
Loop-unrolled transposed [SD]GEMV kernels for A64FX and Neoverse V1 by
2024-12-02 18:46:00 +0900
0c440f8a2
(refs/pull/4994/head)
disable multithreading for small workloads by
2024-11-27 23:15:41 +0100
7fe67fd5d
deploy: 0578a89afd by
2024-11-27 10:55:12 +0000
0578a89af
Merge pull request #4993 from martin-frbg/issue4991 by
2024-11-27 02:54:41 -0800
57a51d74c
(refs/pull/4993/head)
translate CMAKE_SYSTEM_NAME in compilations on or for IOS by
2024-11-27 09:52:56 +0100
35f2e6afe
Merge pull request #4992 from mmuetzel/ci-msys2 by
2024-11-26 10:09:29 -0800