fd3afef12
(refs/pull/5218/head)
lapacke_mangling.h is no longer generated, so don't delete on make clean by
2025-04-10 22:09:19 +0200
b30dc9701
Merge pull request #5215 from annop-w/gemv_t by
2025-04-10 13:06:07 -0700
2893d0add
Merge pull request #5211 from guoyuanplct/develop by
2025-04-10 09:43:03 -0700
f409ee030
deploy: ed1e470663 by
2025-04-10 15:20:15 +0000
ed1e47066
Merge pull request #5217 from haampie/hs/fix/darwin-gcc by
2025-04-10 08:19:46 -0700
3d6d026fe
(refs/pull/5217/head)
no-gcse when loongarch64 by
2025-04-10 15:44:31 +0200
51ba70f47
test_potrs.c: remove pragma darwin-aarch64 support by
2025-04-10 15:20:34 +0200
ec146157d
(refs/pull/5215/head)
Use SVE kernel for S/DGEMVT for SVE machines by
2025-04-02 09:11:58 +0000
de2380e5a
Merge pull request #5214 from martin-frbg/issue5200 by
2025-04-09 10:37:52 -0700
a34b487f2
(refs/pull/5214/head)
Remove spurious cast from Alpha and Cell's DEFAULT_ALIGN by
2025-04-09 17:25:46 +0200
1b3e7cc49
Merge pull request #5212 from martin-frbg/lapack1119 by
2025-04-09 04:37:14 -0700
9056b811f
deploy: 4270d5bc43 by
2025-04-09 08:55:54 +0000
4270d5bc4
Merge pull request #5204 from martin-frbg/issue4692 by
2025-04-09 01:48:31 -0700
880e43ee5
Merge pull request #5198 from martin-frbg/woadlldebug by
2025-04-08 14:22:51 -0700
70865a894
Merge pull request #5180 from ywwry66/openmp_use_cmake by
2025-04-08 13:16:07 -0700
77b14e067
deploy: f0f274725d by
2025-04-08 14:30:30 +0000
f0f274725
Merge pull request #5207 from martin-frbg/issue5202 by
2025-04-08 07:14:16 -0700
94fb7033a
(refs/pull/5212/head)
Fix incomplete error message (Reference-LAPACK PR 1119) by
2025-04-08 07:03:11 -0700
1ff303f36
(refs/pull/5211/head)
Optimizing the Implementation of GEMV on the RISC-V V Extension by
2025-04-08 21:18:00 +0800
fc8090b60
(refs/pull/5207/head)
Move additional omp dependency to EXTRALIB by
2025-04-08 11:54:36 +0200
1c5d0d553
move libomp to extralib by
2025-04-08 10:44:36 +0200
198319a24
deploy: 67c5bdd639 by
2025-04-07 19:24:10 +0000
67c5bdd63
Azure CI: Update flang call in OSX_LLVM_flangnew job (#5208) by
2025-04-07 12:20:43 -0700
eebedaeeb
(refs/pull/5208/head)
Update azure-pipelines.yml by
2025-04-07 21:19:54 +0200
83fc81fe5
Update azure-pipelines.yml by
2025-04-07 17:49:03 +0200
cee76bf59
Update azure-pipelines.yml by
2025-04-07 16:23:28 +0200
8a172486c
Update azure-pipelines.yml by
2025-04-07 15:34:42 +0200
96795e0f4
Update azure-pipelines.yml by
2025-04-07 15:24:20 +0200
02c6bce8e
Update flang call in OSX_LLVM_flangnew job by
2025-04-07 14:47:41 +0200
1ed962d25
Fix compilation with xcode16.3/clang17/gcc14 by
2025-04-06 10:44:48 -0700
7e0af691b
deploy: f0008f50cc by
2025-04-05 21:44:00 +0000
f0008f50c
Merge pull request #5206 from ColumbusAI/develop by
2025-04-05 23:43:31 +0200
7bf848454
(refs/pull/5206/head)
Update zsum.c -- fixed spelling error to successfully compile by
2025-04-05 09:57:53 -0700
0aa5ef29e
(refs/pull/5204/head)
Repeat the libs target's "ln" in the all target to ensure completeness by
2025-04-03 23:54:56 +0200
0be743d5b
deploy: f90eff306d by
2025-04-03 16:11:09 +0000
f90eff306
Merge pull request #5197 from e4t/z-arch-exec-stack by
2025-04-03 18:10:41 +0200
04915be82
(refs/pull/5203/head)
Add vector registers to clobber list to prevent compiler optimization. by
2025-04-03 12:18:43 +0530
3fc15ad81
(refs/pull/5198/head)
Fix pdb file creation in debug dll builds with CMake on Windows/WoA by
2025-03-30 23:22:09 +0200
61b9339d3
(refs/pull/5197/head)
getarch/cpuid.S: Fix warning about executable stack by
2025-03-28 08:59:26 +0100
ea6515c4b
On zarch don't produce objects from assembler with a writable stack section by
2025-03-26 17:35:21 +0100
dd77d3844
deploy: f33943d73e by
2025-03-27 08:18:23 +0000
f33943d73
Merge pull request #5196 from martin-frbg/issue5193 by
2025-03-27 09:17:54 +0100
251c3f857
(refs/pull/5180/head)
gh m1: fix mixed linkage when built with OpenMP and clang+gfortran by
2025-03-26 23:19:40 -0400
1b0c0f00e
CMake: Avoid mixed OpenMP linkage by
2025-03-13 02:25:52 -0400
02fd1df10
CMake: Pass `OpenMP` compiler and linker flags through CMake targets by
2025-03-12 20:41:55 -0400
8b3553420
Merge pull request #5195 from martin-frbg/update-gensymbolpl by
2025-03-26 23:39:53 +0100
51c1fb1f9
(refs/pull/5196/head)
Fix ?spmv build and misinterpretation of NO_LAPACK=0 by
2025-03-26 23:36:49 +0100
3ca1ba1be
(refs/pull/5195/head)
resynchronize with the posix shell version by
2025-03-26 18:37:11 +0100
dc9bb4fd8
(refs/pull/5194/head)
FIX by
2025-03-27 01:11:44 +0800
b46138073
deploy: 72f0abeed5 by
2025-03-26 10:22:33 +0000
72f0abeed
Merge pull request #5191 from Harishmcw/CMake_Symbol_Fix by
2025-03-26 11:22:07 +0100
1724b3f10
(refs/pull/5191/head)
DLL symbol pre/postfixing in CMake builds by
2025-03-26 10:55:50 +0530
c2e7ab535
DLL symbol pre/postfixing in CMake builds by
2025-03-26 10:50:29 +0530
200771078
Merge pull request #5190 from Harishmcw/develop by
2025-03-25 22:32:43 +0100
360a0c3dd
deploy: 4e3afa7beb by
2025-03-25 21:03:19 +0000
4e3afa7be
Merge pull request #5175 from shubhamsvc/dgemv_thread_throttling by
2025-03-25 22:02:48 +0100
c0a5c9655
(refs/pull/5190/head)
Fix missing commas in gensymbol.pl by
2025-03-24 13:49:55 +0530
030bfd1b3
Remove unused and conflicting declarations from the f2c preamble by
2025-03-21 09:21:16 +0100
140da0c8f
Fix f2c conversion errors by
2025-03-20 22:27:05 +0100
cf4c5a6d8
Update f2c-translated stand-ins to include GEMMTR by
2025-03-20 20:20:41 +0100
d1d3342fe
Restore OpenBLAS version of header and add GEMMTR by
2025-03-20 15:44:59 +0100
9fe2784b0
Delete non-applicable header entries from Reference-LAPACK by
2025-03-20 11:44:10 +0100
53e8e569a
(refs/pull/5188/head)
Fix missing quotes around variables that might be empty by
2025-03-20 11:12:21 +0100
a9d24e6cb
Fix source files for gemmtr and sbgemmt by
2025-03-20 11:10:55 +0100
40e1e58e9
Fix DLL symbol name pre/postfixing on Windows by
2025-03-19 22:52:41 +0100
088f3b435
Update CBLAS3 tests from Reference-LAPACK to add GEMMT(R) testing by
2025-03-19 22:41:20 +0100
cfb7685a7
Add cblas_?gemmtr aliases of cblas_?gemmt by
2025-03-19 22:36:56 +0100
8e289ecdd
(refs/pull/5175/head)
Simplified thread throttling function in gemv by
2025-03-18 13:24:05 +0530
189dbbc04
Add thread throttling for dynamic arch neoversev1 by
2025-03-04 16:08:55 +0530
b6cb5ece5
Add thread throttling profile for DGEMV on NEOVERSEV1 by
2025-02-28 13:10:40 +0530
64a6bc16b
deploy: 51c244a098 by
2025-03-15 16:25:59 +0000
51c244a09
Merge pull request #5184 from taoye9/fix_sbgemv_n_bug by
2025-03-15 17:25:33 +0100
f27ba5efd
(refs/pull/5184/head)
fix bugs in aarch64 sbgemv_n kernel by
2025-03-14 17:55:40 +0000
e9fbe0a83
Merge pull request #5183 from annop-w/fix_sbgemv_t by
2025-03-13 23:04:09 +0100
edef2e444
(refs/pull/5183/head)
Fix bug in ARM64 sbgemv_t by
2025-03-13 20:55:31 +0000
b55ca71d5
Merge pull request #5182 from annop-w/sgemm_ncopy by
2025-03-13 16:04:39 +0100
2f778554b
Merge pull request #5181 from taoye9/change_sbgemn_cast_bf16 by
2025-03-13 13:50:26 +0100
acd9975d6
deploy: 66e0f1e621 by
2025-03-13 11:05:23 +0000
66e0f1e62
Merge pull request #5178 from martin-frbg/lapack_cplx_dummy by
2025-03-13 11:57:29 +0100
9807f5658
(refs/pull/5182/head)
Optimize aarch64 sgemm_ncopy by
2025-03-12 21:26:27 +0000
43f413748
deploy: 1ba02656e6 by
2025-03-13 06:34:21 +0000
1ba02656e
Merge pull request #5177 from martin-frbg/cmakelapacke by
2025-03-13 07:33:52 +0100
8a418b1aa
(refs/pull/5178/head)
Add dummy implementations for the LAPACK_COMPLEX_CUSTOM case by
2025-03-12 23:20:16 +0100
b34235ca6
(refs/pull/5177/head)
Fix inclusion of deprecated interfaces and cgesvdq/strsyl3 by
2025-03-12 22:41:50 +0100
37b854769
Merge pull request #5173 from nakagawa-fj/gemm_load_imbalance by
2025-03-12 22:38:02 +0100
a3e7b1607
Merge pull request #5157 from manaalmj/feature by
2025-03-12 21:08:23 +0100
67156a641
deploy: 8865850496 by
2025-03-12 17:50:21 +0000
886585049
Merge pull request #5176 from annop-w/fix_sbgemv_t by
2025-03-12 18:49:54 +0100
4c00099ed
(refs/pull/5181/head)
replace customize bf16_to_fp32 with arm neon vcvtah_f32_bf16 by
2025-03-12 16:20:15 +0000
a085b6c9e
(refs/pull/5176/head)
Fix aarch64 sbgemv_t compilation error for GCC < 13 by
2025-03-12 14:49:10 +0000
80d3c2ad9
(refs/pull/5173/head)
Add Improving Load Imbalance in Thread-Parallel GEMM by
2025-03-11 20:18:20 +0900
5c4e38ab1
(refs/pull/5157/head)
Optimize gemv_n_sve kernel by
2025-02-27 09:39:06 +0000
27adf9f80
deploy: 39eb43d441 by
2025-03-07 12:58:24 +0000
39eb43d44
Improve thread safety of pthreads builds that rely on C11 atomic operations for locking (#5170) by
2025-03-07 13:48:28 +0100
3a3318006
(refs/pull/5170/head)
Use atomic acquire on load, release on store by
2025-03-07 10:31:33 +0100
6610db4eb
switch to full ACQ_REL semantics by
2025-03-04 22:37:51 +0100
98206dbdb
Tighten memory orders for C11 atomic operations by
2025-03-04 20:04:22 +0100
8c65ea4ed
deploy: 1d5ed5c46b by
2025-03-04 15:39:58 +0000
1d5ed5c46
Merge pull request #5168 from taoye9/add_sbgemvn_on_neonversen2 by
2025-03-04 16:39:22 +0100
7338a473a
Merge pull request #5150 from Harishmcw/WoA-Experiments by
2025-03-03 21:45:53 +0100