Commit Graph

  • *
  • | *
  • |/
  • *
  • |\
  • | | *
  • | |/
  • |/|
  • * |
  • |\ \
  • * \ \
  • |\ \ \
  • | | | | *
  • * | | | |
  • |\ \ \ \ \
  • | | | * | |
  • | |_|/ / /
  • |/| | | |
  • | | * | |
  • | |/ / /
  • |/| | |
  • | * | |
  • |/ / /
  • | * /
  • |/ /
  • | *
  • * |
  • |\ \
  • | * |
  • |/ /
  • | *
  • * |
  • |\ \
  • | * |
  • | | | *
  • | | | *
  • | |_|/
  • |/| |
  • | * |
  • * | |
  • |\ \ \
  • * \ \ \
  • |\ \ \ \
  • | * | | |
  • | * | | |
  • | | * | |
  • | |/ / /
  • |/| | |
  • * | | |
  • |\ \ \ \
  • * \ \ \ \
  • |\ \ \ \ \
  • | | | | | *
  • * | | | | |
  • |\ \ \ \ \ \
  • | | | | | * |
  • | | | | | * |
  • | | | | | * |
  • | |_|_|_|/ /
  • |/| | | | |
  • | | | * | |
  • | |_|/ / /
  • |/| | | |
  • * | | | |
  • |\ \ \ \ \
  • | | | | * |
  • | |_|_|/ /
  • |/| | | |
  • | | | * |
  • | |_|/ /
  • |/| | |
  • | * | |
  • | | | *
  • * | | |
  • |\ \ \ \
  • | | * | |
  • | |/ / /
  • |/| | |
  • * | | |
  • |\ \ \ \
  • | | * | |
  • | |/ / /
  • |/| | |
  • | | | *
  • * | | |
  • |\ \ \ \
  • | * | | |
  • |/ / / /
  • | | | | *
  • | * | | |
  • |/ / / /
  • | | | *
  • | | | *
  • | |_|/
  • |/| |
  • | * |
  • |/ /
  • * |
  • |\ \
  • | * |
  • | * |
  • |/ /
  • | *
  • * |
  • |\ \
  • | * |
  • | * |
  • |/ /
  • | | *
  • | | *
  • | |/
  • |/|
  • | | *
  • | | *
  • | | *
  • | | |\
  • | |_|/
  • |/| |
  • | | *
  • | * |
  • * | |
  • |\ \ \
  • | * | |
  • | * | |
  • | * | |
  • |/ / /
  • | | *
  • | | |\
  • | |_|/
  • |/| |
  • | | *
  • | * |
  • * | |
  • |\ \ \
  • | * | |
  • | | * |
  • * | | |
  • |\ \ \ \
  • | | | * |
  • * | | | |
  • |\ \ \ \ \
  • * \ \ \ \ \
  • |\ \ \ \ \ \
  • | * | | | | |
  • |/ / / / / /
  • | | * | | |
  • | | * | | |
  • | | * | | |
  • * | | | | |
  • |\ \ \ \ \ \
  • | | | | | * |
  • * | | | | | |
  • |\ \ \ \ \ \ \
  • | | * | | | | |
  • | |/ / / / / /
  • |/| | | | | |
  • | | | * | | |
  • | | |/ / / /
  • | |/| | | |
  • | * | | | |
  • |/ / / / /
  • | * | | |
  • | * | | |
  • |/ / / /
  • * | | |
  • |\ \ \ \
  • | * | | |
  • |/ / / /
  • | | * |
  • * | | |
  • |\ \ \ \
  • | * | | |
  • |/ / / /
  • * | | |
  • |\ \ \ \
  • | * | | |
  • | * | | |
  • | * | | |
  • b3ffd5524 Include NEON header for the bfloat conversion functions by Martin Kroeker 2025-08-04 00:20:28 -0700
  • 52792f6da (refs/pull/5413/head, revert-5180-openmp_use_cmake) Revert "CMake: Pass `OpenMP` compiler and linker flags through CMake targets" by Martin Kroeker 2025-07-31 20:36:01 +0200
  • d23680b81 Merge pull request #5407 from nakagawa-fj/feature/gemm_divide_rate_for_neoversev1 by Martin Kroeker 2025-07-30 13:19:50 -0700
  • 51ee3812f Update to 20.1.8 by Martin Kroeker 2025-07-30 22:17:08 +0200
  • b4cc4be2c Merge pull request #5410 from martin-frbg/issue5404 by Martin Kroeker 2025-07-30 12:16:05 -0700
  • 0968dddf1 Merge pull request #5409 from martin-frbg/issue5372 by Martin Kroeker 2025-07-30 10:36:39 -0700
  • 7047e8f07 deploy: eddfe1e6b3 by martin-frbg 2025-07-30 16:18:05 +0000
  • eddfe1e6b Merge pull request #5408 from ChipKerchner/fixRISCV64GEMVInitializationAndWarnings by Martin Kroeker 2025-07-30 08:43:08 -0700
  • 30d11bc92 (refs/pull/5410/head) Adjust multithreading threshold and add an intermediate step by Martin Kroeker 2025-07-30 08:13:33 -0700
  • a3b9c933c (refs/pull/5409/head) mark xbuffer as volatile to work around gcc15.1 optimizer bug by Martin Kroeker 2025-07-30 17:05:36 +0200
  • 72f082f31 (refs/pull/5408/head) Fix bad vector zero initializer and other compiler warnings for RISC-V. by Chip Kerchner 2025-07-30 14:04:43 +0000
  • 7e29f1139 (refs/pull/5407/head) Multi-thread GEMM Performance Improvement on NeoverseV1 (DIVIDE_RATE=1) by Masato Nakagawa 2025-07-29 18:54:36 +0900
  • 665b6a048 deploy: 9a64b32b44 by martin-frbg 2025-07-29 06:17:55 +0000
  • 9a64b32b4 Merge pull request #5406 from martin-frbg/fixbgemmtest by Martin Kroeker 2025-07-28 23:17:29 -0700
  • b66a01f90 (refs/pull/5406/head) Fix building of bgemm tests on GEMM3M-capable (x86) targets by Martin Kroeker 2025-07-28 22:43:28 +0200
  • 0aa2a5466 deploy: a5e7c0e3e0 by martin-frbg 2025-07-28 20:39:38 +0000
  • a5e7c0e3e Merge pull request #5396 from abhishek-iitmadras/abhishekk_bfloat16 by Martin Kroeker 2025-07-28 13:39:08 -0700
  • 6356190d0 (refs/pull/5396/head) fix gfortran link path in dynamic_arch.yml by abhishek-fujitsu 2025-07-28 14:37:29 +0530
  • b8e6dafc5 (refs/pull/5405/head) temporary change to host-specific build by Martin Kroeker 2025-07-27 10:27:27 +0200
  • 809bd9d3c temporary upgrade to a Graviton4 instance for V2 build testing by Martin Kroeker 2025-07-27 10:24:27 +0200
  • 4c8dcb3a8 Darwin/arm64: disable SVE/SME and fix gfortran link path by abhishek-fujitsu 2025-07-26 16:59:46 +0530
  • 33b50548e Merge pull request #5403 from martin-frbg/issue5402 by Martin Kroeker 2025-07-25 20:10:47 +0200
  • c504aedca Merge pull request #5400 from Mousius/neoversev2-target by Martin Kroeker 2025-07-25 15:47:06 +0200
  • b9e107932 (refs/pull/5400/head) add NeoverseV2 by Martin Kroeker 2025-07-25 15:44:34 +0200
  • 2f89a5970 fix NeoverseV2 typo by Martin Kroeker 2025-07-25 15:43:37 +0200
  • a9e8fa06b (refs/pull/5403/head) Introduce a (crude) threshold to multithreading by Martin Kroeker 2025-07-25 15:15:46 +0200
  • b4c2b34a4 Merge pull request #5401 from martin-frbg/followup-5397 by Martin Kroeker 2025-07-25 13:56:13 +0200
  • c9204f7b6 Merge pull request #5399 from Mousius/bgemm-8x4 by Martin Kroeker 2025-07-25 11:20:52 +0200
  • cc6055270 deploy: a55e65dba9 by martin-frbg 2025-07-25 07:29:18 +0000
  • a55e65dba Merge pull request #5391 from martin-frbg/issue5387 by Martin Kroeker 2025-07-25 09:28:46 +0200
  • 0bc79da58 add neon header by abhishek-fujitsu 2025-07-25 11:10:20 +0530
  • 720a4743b update contribution list by abhishek-fujitsu 2025-07-23 17:41:34 +0530
  • 05fc88180 ARM64: Enable bfloat16 kernels by default by abhishek-fujitsu 2025-05-19 18:34:38 +0530
  • 965463f17 (refs/pull/5401/head) Include float-bfloat conversion functions in ONLY_CBLAS builds as well by Martin Kroeker 2025-07-24 23:33:20 +0200
  • 4272cf8c7 Merge pull request #5398 from martin-frbg/fixup-5394 by Martin Kroeker 2025-07-24 23:29:39 +0200
  • 87247daad Add NEOVERSEV2 target support by Chris Sidebottom 2025-07-24 11:30:43 +0000
  • ea2faf0c9 (refs/pull/5399/head) Add optimized BGEMM for NEOVERSEN2 target by Chris Sidebottom 2025-07-21 17:09:47 +0000
  • a5b55f6fe (refs/pull/5398/head) remove CBLAS restriction on GEMM_GEMV forwarding by Martin Kroeker 2025-07-24 09:30:58 +0200
  • 8f3b46011 deploy: a4f4662459 by martin-frbg 2025-07-24 07:28:54 +0000
  • a4f466245 Merge pull request #5397 from omegacoleman/fix-cblas-bgemm by Martin Kroeker 2025-07-24 09:28:25 +0200
  • 82954ba4c Update ?GEMM-to-?GEMV forwarding settings by Martin Kroeker 2025-07-23 23:24:42 +0200
  • 392d38168 Merge pull request #5394 from Mousius/optimize-bgemv by Martin Kroeker 2025-07-23 23:13:44 +0200
  • 41f9701eb (refs/pull/5397/head) Fix cmake building with cblas_bgemm by youcai 2025-07-23 21:51:30 +0800
  • 341f3e8bc deploy: f4caa61e47 by martin-frbg 2025-07-23 12:50:04 +0000
  • f4caa61e4 Merge pull request #5395 from martin-frbg/fixloongsonCI by Martin Kroeker 2025-07-23 14:36:30 +0200
  • 444d03db9 (refs/pull/5395/head) switch to another site that still has libffi6 (for now) by Martin Kroeker 2025-07-23 14:04:11 +0200
  • 06ced6da1 (refs/pull/5393/head) Bump xuantie toolchains V3.1.0 for c910v by xctan 2025-07-23 17:36:40 +0800
  • 2c3cdaf74 (refs/pull/5394/head) Optimized BGEMV for NEOVERSEV1 target by Chris Sidebottom 2025-07-21 17:09:47 +0000
  • 6144004e9 Fix xtheadvector compilation by xctan 2025-07-23 16:49:24 +0800
  • 4a94ef57e Bump xuantie toolchains V3.0.2 for c910v by Han Gao 2025-07-23 01:34:31 +0800
  • 7d908564f (refs/pull/5391/head) Use OpenBLAS_ROOT_DIR in CMake config file generation only if set by Martin Kroeker 2025-07-22 16:01:46 +0200
  • 2f81d6e60 Merge pull request #5390 from martin-frbg/issue5388-2 by Martin Kroeker 2025-07-22 13:05:14 +0200
  • e2d941e9a (refs/pull/5390/head) Declare the "small" kernel static in addition to inline by Martin Kroeker 2025-07-22 11:02:32 +0200
  • 821470093 Declare the "small" kernel static in addition to inline by Martin Kroeker 2025-07-22 11:01:37 +0200
  • ff4f949a7 deploy: 4ae8707b54 by martin-frbg 2025-07-22 08:58:28 +0000
  • 4ae8707b5 Merge pull request #5389 from martin-frbg/issue5388 by Martin Kroeker 2025-07-22 10:57:59 +0200
  • b24212f5d (refs/pull/5389/head) fix numbers by Martin Kroeker 2025-07-21 22:54:52 +0200
  • 6ff06f548 Add cross-compilation data for RISCV64 targets by Martin Kroeker 2025-07-21 22:42:15 +0200
  • 2049628f2 (refs/pull/5318/head) Enable lapack+OpenMP on MinGW-w64. by مهدي شينون (Mehdi Chinoune) 2025-06-19 12:48:11 +0100
  • 14f74d2bb Don't rename symbols on MinGW-w64 by مهدي شينون (Mehdi Chinoune) 2025-06-19 12:47:09 +0100
  • b537c1be4 (refs/pull/5187/head) Add files via upload by Martin Kroeker 2025-07-20 09:23:33 +0200
  • b7e55475a fix checks by Martin Kroeker 2025-07-20 08:45:39 +0200
  • c59a6194b Merge branch 'OpenMathLib:develop' into gemmt_tests by Martin Kroeker 2025-07-19 13:21:18 +0200
  • d8b3bdf7a cleanup by Martin Kroeker 2025-07-19 13:20:47 +0200
  • ff16fb4f7 deploy: d92f151634 by martin-frbg 2025-07-19 06:47:16 +0000
  • d92f15163 Merge pull request #5386 from martin-frbg/issue5384 by Martin Kroeker 2025-07-19 08:33:51 +0200
  • 30dbca505 (refs/pull/5386/head) fix misleading indentation to silence a gcc warning by Martin Kroeker 2025-07-18 23:51:04 +0200
  • 38e699929 format cleanup by Martin Kroeker 2025-07-18 23:45:08 +0200
  • 3df503caf portability fix and cleanup by Martin Kroeker 2025-07-18 23:41:57 +0200
  • 4e0cf1ecc Merge branch 'OpenMathLib:develop' into gemmt_tests by Martin Kroeker 2025-07-18 23:26:06 +0200
  • d7d6e6b53 Adjust tests to conform to the behavior now codified by the Reference BLAS by Martin Kroeker 2025-07-18 23:25:56 +0200
  • b5f1223a4 deploy: 39c90f9859 by martin-frbg 2025-07-18 21:24:14 +0000
  • 39c90f985 Merge pull request #5380 from quic/topic/sgemm_direct_sme1_alpha_beta by Martin Kroeker 2025-07-18 23:23:39 +0200
  • eae0abfdb (refs/pull/5380/head) SME1 based direct kernel with alpha and beta for cblas_sgemm level 3 API. by Rajendra Prasad Matcha 2025-07-11 14:51:16 +0530
  • 2b8fe330a deploy: ac8cbfdd8e by martin-frbg 2025-07-16 21:22:33 +0000
  • ac8cbfdd8 Merge pull request #5381 from Mousius/bgemv-infrastructure by Martin Kroeker 2025-07-16 23:22:08 +0200
  • 287963743 deploy: 08df0f02d9 by martin-frbg 2025-07-15 19:29:59 +0000
  • 1742decdc Merge pull request #5375 from lowkeyrossi/CI_for_WoA by Martin Kroeker 2025-07-15 21:16:03 +0200
  • 08df0f02d Merge pull request #5382 from martin-frbg/issue5379 by Martin Kroeker 2025-07-15 21:07:34 +0200
  • 7d7757acd (refs/pull/5382/head) Update cross-compilation instructions for the Android NDK by Martin Kroeker 2025-07-15 18:25:55 +0200
  • 947d7af4c (refs/pull/5381/head) Fix CMake references to bscal and bgemv by Chris Sidebottom 2025-07-15 14:41:19 +0000
  • 72d2ebb4d Re-add GEMV fallback for Level3 by Chris Sidebottom 2025-07-15 15:00:20 +0100
  • e10541146 Add infrastructure for bgemv/bscal by Chris Sidebottom 2025-07-13 15:25:07 +0000
  • 666e1081a Merge pull request #5378 from martin-frbg/cpuid_lunarlake by Martin Kroeker 2025-07-13 23:18:22 +0200
  • 5a0c2b30d deploy: 3ea6322eff by martin-frbg 2025-07-13 21:04:03 +0000
  • 3ea6322ef Merge pull request #5377 from Mousius/test-fixes by Martin Kroeker 2025-07-13 23:03:35 +0200
  • 848e9e6ba (refs/pull/5378/head) Add ID data for Intel Lunar Lake ("Core Ultra 200V series") by Martin Kroeker 2025-07-13 20:34:19 +0200
  • 09a016fdf Split sbgemv test from sbgemm test by Chris Sidebottom 2025-07-13 13:01:27 +0000
  • 3f110c827 (refs/pull/5377/head) Improve bgemm and sbgemm testing by Chris Sidebottom 2025-07-13 12:48:09 +0000
  • cb2c72671 (refs/pull/5375/head) Add CI support for OpenBLAS on WoA by newyork_loki 2025-07-12 14:37:30 +0530
  • c8d41e4a3 Add CI support for OpenBLAS on WoA by newyork_loki 2025-07-12 14:34:29 +0530
  • 81b30d453 Merge pull request #5374 from martin-frbg/fixup-5373 by Martin Kroeker 2025-07-11 15:33:38 +0200
  • aad97c776 (refs/pull/5374/head) Fix return type declaration by Martin Kroeker 2025-07-11 15:32:41 +0200
  • f2ee10172 deploy: 7acb122a98 by martin-frbg 2025-07-11 09:57:26 +0000
  • 7acb122a9 Merge pull request #5373 from Mousius/bgemm-optimized by Martin Kroeker 2025-07-11 11:56:56 +0200
  • 740efd71c (refs/pull/5373/head) Add optimized BGEMM kernel for NEOVERSEV1 target by Chris Sidebottom 2025-07-10 23:23:27 +0000
  • e927373f6 Merge pull request #5371 from martin-frbg/fixup-5357 by Martin Kroeker 2025-07-10 16:38:37 +0200
  • 9a272fece (refs/pull/5371/head) Re-enable the BGEMM tests by Martin Kroeker 2025-07-10 15:02:59 +0200
  • b54aec804 remove spurious include by Martin Kroeker 2025-07-10 15:00:30 +0200
  • 343830c26 Add BGEMM parameter tables by Martin Kroeker 2025-07-10 14:59:46 +0200