b3ffd5524
Include NEON header for the bfloat conversion functions by
2025-08-04 00:20:28 -0700
52792f6da
(refs/pull/5413/head, revert-5180-openmp_use_cmake)
Revert "CMake: Pass `OpenMP` compiler and linker flags through CMake targets" by
2025-07-31 20:36:01 +0200
d23680b81
Merge pull request #5407 from nakagawa-fj/feature/gemm_divide_rate_for_neoversev1 by
2025-07-30 13:19:50 -0700
51ee3812f
Update to 20.1.8 by
2025-07-30 22:17:08 +0200
b4cc4be2c
Merge pull request #5410 from martin-frbg/issue5404 by
2025-07-30 12:16:05 -0700
0968dddf1
Merge pull request #5409 from martin-frbg/issue5372 by
2025-07-30 10:36:39 -0700
7047e8f07
deploy: eddfe1e6b3 by
2025-07-30 16:18:05 +0000
eddfe1e6b
Merge pull request #5408 from ChipKerchner/fixRISCV64GEMVInitializationAndWarnings by
2025-07-30 08:43:08 -0700
30d11bc92
(refs/pull/5410/head)
Adjust multithreading threshold and add an intermediate step by
2025-07-30 08:13:33 -0700
a3b9c933c
(refs/pull/5409/head)
mark xbuffer as volatile to work around gcc15.1 optimizer bug by
2025-07-30 17:05:36 +0200
72f082f31
(refs/pull/5408/head)
Fix bad vector zero initializer and other compiler warnings for RISC-V. by
2025-07-30 14:04:43 +0000
7e29f1139
(refs/pull/5407/head)
Multi-thread GEMM Performance Improvement on NeoverseV1 (DIVIDE_RATE=1) by
2025-07-29 18:54:36 +0900
665b6a048
deploy: 9a64b32b44 by
2025-07-29 06:17:55 +0000
9a64b32b4
Merge pull request #5406 from martin-frbg/fixbgemmtest by
2025-07-28 23:17:29 -0700
b66a01f90
(refs/pull/5406/head)
Fix building of bgemm tests on GEMM3M-capable (x86) targets by
2025-07-28 22:43:28 +0200
0aa2a5466
deploy: a5e7c0e3e0 by
2025-07-28 20:39:38 +0000
a5e7c0e3e
Merge pull request #5396 from abhishek-iitmadras/abhishekk_bfloat16 by
2025-07-28 13:39:08 -0700
6356190d0
(refs/pull/5396/head)
fix gfortran link path in dynamic_arch.yml by
2025-07-28 14:37:29 +0530
b8e6dafc5
(refs/pull/5405/head)
temporary change to host-specific build by
2025-07-27 10:27:27 +0200
809bd9d3c
temporary upgrade to a Graviton4 instance for V2 build testing by
2025-07-27 10:24:27 +0200
4c8dcb3a8
Darwin/arm64: disable SVE/SME and fix gfortran link path by
2025-07-26 16:59:46 +0530
33b50548e
Merge pull request #5403 from martin-frbg/issue5402 by
2025-07-25 20:10:47 +0200
c504aedca
Merge pull request #5400 from Mousius/neoversev2-target by
2025-07-25 15:47:06 +0200
b9e107932
(refs/pull/5400/head)
add NeoverseV2 by
2025-07-25 15:44:34 +0200
2f89a5970
fix NeoverseV2 typo by
2025-07-25 15:43:37 +0200
a9e8fa06b
(refs/pull/5403/head)
Introduce a (crude) threshold to multithreading by
2025-07-25 15:15:46 +0200
b4c2b34a4
Merge pull request #5401 from martin-frbg/followup-5397 by
2025-07-25 13:56:13 +0200
c9204f7b6
Merge pull request #5399 from Mousius/bgemm-8x4 by
2025-07-25 11:20:52 +0200
cc6055270
deploy: a55e65dba9 by
2025-07-25 07:29:18 +0000
a55e65dba
Merge pull request #5391 from martin-frbg/issue5387 by
2025-07-25 09:28:46 +0200
0bc79da58
add neon header by
2025-07-25 11:10:20 +0530
720a4743b
update contribution list by
2025-07-23 17:41:34 +0530
05fc88180
ARM64: Enable bfloat16 kernels by default by
2025-05-19 18:34:38 +0530
965463f17
(refs/pull/5401/head)
Include float-bfloat conversion functions in ONLY_CBLAS builds as well by
2025-07-24 23:33:20 +0200
4272cf8c7
Merge pull request #5398 from martin-frbg/fixup-5394 by
2025-07-24 23:29:39 +0200
87247daad
Add NEOVERSEV2 target support by
2025-07-24 11:30:43 +0000
ea2faf0c9
(refs/pull/5399/head)
Add optimized BGEMM for NEOVERSEN2 target by
2025-07-21 17:09:47 +0000
a5b55f6fe
(refs/pull/5398/head)
remove CBLAS restriction on GEMM_GEMV forwarding by
2025-07-24 09:30:58 +0200
8f3b46011
deploy: a4f4662459 by
2025-07-24 07:28:54 +0000
a4f466245
Merge pull request #5397 from omegacoleman/fix-cblas-bgemm by
2025-07-24 09:28:25 +0200
82954ba4c
Update ?GEMM-to-?GEMV forwarding settings by
2025-07-23 23:24:42 +0200
392d38168
Merge pull request #5394 from Mousius/optimize-bgemv by
2025-07-23 23:13:44 +0200
41f9701eb
(refs/pull/5397/head)
Fix cmake building with cblas_bgemm by
2025-07-23 21:51:30 +0800
341f3e8bc
deploy: f4caa61e47 by
2025-07-23 12:50:04 +0000
f4caa61e4
Merge pull request #5395 from martin-frbg/fixloongsonCI by
2025-07-23 14:36:30 +0200
444d03db9
(refs/pull/5395/head)
switch to another site that still has libffi6 (for now) by
2025-07-23 14:04:11 +0200
06ced6da1
(refs/pull/5393/head)
Bump xuantie toolchains V3.1.0 for c910v by
2025-07-23 17:36:40 +0800
2c3cdaf74
(refs/pull/5394/head)
Optimized BGEMV for NEOVERSEV1 target by
2025-07-21 17:09:47 +0000
6144004e9
Fix xtheadvector compilation by
2025-07-23 16:49:24 +0800
4a94ef57e
Bump xuantie toolchains V3.0.2 for c910v by
2025-07-23 01:34:31 +0800
7d908564f
(refs/pull/5391/head)
Use OpenBLAS_ROOT_DIR in CMake config file generation only if set by
2025-07-22 16:01:46 +0200
2f81d6e60
Merge pull request #5390 from martin-frbg/issue5388-2 by
2025-07-22 13:05:14 +0200
e2d941e9a
(refs/pull/5390/head)
Declare the "small" kernel static in addition to inline by
2025-07-22 11:02:32 +0200
821470093
Declare the "small" kernel static in addition to inline by
2025-07-22 11:01:37 +0200
ff4f949a7
deploy: 4ae8707b54 by
2025-07-22 08:58:28 +0000
4ae8707b5
Merge pull request #5389 from martin-frbg/issue5388 by
2025-07-22 10:57:59 +0200
b24212f5d
(refs/pull/5389/head)
fix numbers by
2025-07-21 22:54:52 +0200
6ff06f548
Add cross-compilation data for RISCV64 targets by
2025-07-21 22:42:15 +0200
2049628f2
(refs/pull/5318/head)
Enable lapack+OpenMP on MinGW-w64. by
2025-06-19 12:48:11 +0100
14f74d2bb
Don't rename symbols on MinGW-w64 by
2025-06-19 12:47:09 +0100
b537c1be4
(refs/pull/5187/head)
Add files via upload by
2025-07-20 09:23:33 +0200
b7e55475a
fix checks by
2025-07-20 08:45:39 +0200
c59a6194b
Merge branch 'OpenMathLib:develop' into gemmt_tests by
2025-07-19 13:21:18 +0200
d8b3bdf7a
cleanup by
2025-07-19 13:20:47 +0200
ff16fb4f7
deploy: d92f151634 by
2025-07-19 06:47:16 +0000
d92f15163
Merge pull request #5386 from martin-frbg/issue5384 by
2025-07-19 08:33:51 +0200
30dbca505
(refs/pull/5386/head)
fix misleading indentation to silence a gcc warning by
2025-07-18 23:51:04 +0200
38e699929
format cleanup by
2025-07-18 23:45:08 +0200
3df503caf
portability fix and cleanup by
2025-07-18 23:41:57 +0200
4e0cf1ecc
Merge branch 'OpenMathLib:develop' into gemmt_tests by
2025-07-18 23:26:06 +0200
d7d6e6b53
Adjust tests to conform to the behavior now codified by the Reference BLAS by
2025-07-18 23:25:56 +0200
b5f1223a4
deploy: 39c90f9859 by
2025-07-18 21:24:14 +0000
39c90f985
Merge pull request #5380 from quic/topic/sgemm_direct_sme1_alpha_beta by
2025-07-18 23:23:39 +0200
eae0abfdb
(refs/pull/5380/head)
SME1 based direct kernel with alpha and beta for cblas_sgemm level 3 API. by
2025-07-11 14:51:16 +0530
2b8fe330a
deploy: ac8cbfdd8e by
2025-07-16 21:22:33 +0000
ac8cbfdd8
Merge pull request #5381 from Mousius/bgemv-infrastructure by
2025-07-16 23:22:08 +0200
287963743
deploy: 08df0f02d9 by
2025-07-15 19:29:59 +0000
1742decdc
Merge pull request #5375 from lowkeyrossi/CI_for_WoA by
2025-07-15 21:16:03 +0200
08df0f02d
Merge pull request #5382 from martin-frbg/issue5379 by
2025-07-15 21:07:34 +0200
7d7757acd
(refs/pull/5382/head)
Update cross-compilation instructions for the Android NDK by
2025-07-15 18:25:55 +0200
947d7af4c
(refs/pull/5381/head)
Fix CMake references to bscal and bgemv by
2025-07-15 14:41:19 +0000
72d2ebb4d
Re-add GEMV fallback for Level3 by
2025-07-15 15:00:20 +0100
e10541146
Add infrastructure for bgemv/bscal by
2025-07-13 15:25:07 +0000
666e1081a
Merge pull request #5378 from martin-frbg/cpuid_lunarlake by
2025-07-13 23:18:22 +0200
5a0c2b30d
deploy: 3ea6322eff by
2025-07-13 21:04:03 +0000
3ea6322ef
Merge pull request #5377 from Mousius/test-fixes by
2025-07-13 23:03:35 +0200
848e9e6ba
(refs/pull/5378/head)
Add ID data for Intel Lunar Lake ("Core Ultra 200V series") by
2025-07-13 20:34:19 +0200
09a016fdf
Split sbgemv test from sbgemm test by
2025-07-13 13:01:27 +0000
3f110c827
(refs/pull/5377/head)
Improve bgemm and sbgemm testing by
2025-07-13 12:48:09 +0000
cb2c72671
(refs/pull/5375/head)
Add CI support for OpenBLAS on WoA by
2025-07-12 14:37:30 +0530
c8d41e4a3
Add CI support for OpenBLAS on WoA by
2025-07-12 14:34:29 +0530
81b30d453
Merge pull request #5374 from martin-frbg/fixup-5373 by
2025-07-11 15:33:38 +0200
aad97c776
(refs/pull/5374/head)
Fix return type declaration by
2025-07-11 15:32:41 +0200
f2ee10172
deploy: 7acb122a98 by
2025-07-11 09:57:26 +0000
7acb122a9
Merge pull request #5373 from Mousius/bgemm-optimized by
2025-07-11 11:56:56 +0200
740efd71c
(refs/pull/5373/head)
Add optimized BGEMM kernel for NEOVERSEV1 target by
2025-07-10 23:23:27 +0000
e927373f6
Merge pull request #5371 from martin-frbg/fixup-5357 by
2025-07-10 16:38:37 +0200
9a272fece
(refs/pull/5371/head)
Re-enable the BGEMM tests by
2025-07-10 15:02:59 +0200
b54aec804
remove spurious include by
2025-07-10 15:00:30 +0200
343830c26
Add BGEMM parameter tables by
2025-07-10 14:59:46 +0200