Commit Graph

  • *
  • *
  • |\
  • * \
  • |\ \
  • | | | *
  • * | | |
  • |\ \ \ \
  • | * | | |
  • | * | | |
  • |/ / / /
  • | | * /
  • | |/ /
  • |/| |
  • * | |
  • |\ \ \
  • | * | |
  • |/ / /
  • * | |
  • |\ \ \
  • | | | *
  • * | | |
  • |\ \ \ \
  • * \ \ \ \
  • |\ \ \ \ \
  • * \ \ \ \ \
  • |\ \ \ \ \ \
  • | | | | | | *
  • * | | | | | |
  • |\ \ \ \ \ \ \
  • | | | | | * | |
  • | |_|_|_|/ / /
  • |/| | | | | |
  • | | | | | * |
  • | |_|_|_|/ /
  • |/| | | | |
  • | * | | | |
  • | * | | | |
  • | | | | | *
  • * | | | | |
  • | | | | | | *
  • | | | | | | *
  • | | | | | | *
  • | | | | | | *
  • | | | | | | *
  • | | | | | | *
  • | |_|_|_|_|/
  • |/| | | | |
  • | * | | | |
  • |/ / / / /
  • | | | | *
  • * | | | |
  • |\ \ \ \ \
  • | * | | | |
  • |/ / / / /
  • | | | * /
  • | |_|/ /
  • |/| | |
  • | | | *
  • * | | |
  • |\ \ \ \
  • | | | | | *
  • | |_|_|_|/
  • |/| | | |
  • | | | * |
  • | |_|/ /
  • |/| | |
  • | * | |
  • | * | |
  • | | | *
  • * | | |
  • |\ \ \ \
  • | | | * |
  • | | | * |
  • | | | * |
  • | |_|/ /
  • |/| | |
  • * | | |
  • |\ \ \ \
  • | |_|/ /
  • |/| | |
  • | | * |
  • | |/ /
  • |/| |
  • | * |
  • |/ /
  • | | *
  • | * |
  • * | |
  • |\ \ \
  • | |_|/
  • |/| |
  • | * |
  • | * |
  • * | |
  • |\| |
  • | | *
  • * | |
  • |\ \ \
  • | | * |
  • | |/ /
  • |/| |
  • | | | *
  • | | | *
  • | | | *
  • | | | *
  • | | | *
  • | | | | *
  • | | | * |
  • | | | | *
  • | |_|_|/
  • |/| | |
  • | | | *
  • | | | *
  • | |_|/
  • |/| |
  • | * |
  • | * |
  • | * |
  • |/ /
  • | *
  • * |
  • |\ \
  • | * |
  • |/ /
  • * |
  • |\ \
  • | * |
  • |/ /
  • * |
  • |\ \
  • * \ \
  • |\ \ \
  • | | | *
  • * | | |
  • |\ \ \ \
  • | | | * |
  • | |_|/ /
  • |/| | |
  • | | | *
  • * | | |
  • |\ \ \ \
  • | | * | |
  • | |/ / /
  • |/| | |
  • | * | |
  • |/ / /
  • * | |
  • |\ \ \
  • * \ \ \
  • |\ \ \ \
  • | | | | *
  • * | | | |
  • |\ \ \ \ \
  • | | | | * |
  • | |_|_|/ /
  • |/| | | |
  • | * | | |
  • |/ / / /
  • | | * /
  • | |/ /
  • |/| |
  • | * |
  • | | *
  • * | |
  • | | | *
  • | | | *
  • | | | *
  • | |_|/
  • |/| |
  • | | *
  • * | |
  • |\ \ \
  • * \ \ \
  • |\ \ \ \
  • fd3afef12 (refs/pull/5218/head) lapacke_mangling.h is no longer generated, so don't delete on make clean by Martin Kroeker 2025-04-10 22:09:19 +0200
  • b30dc9701 Merge pull request #5215 from annop-w/gemv_t by Martin Kroeker 2025-04-10 13:06:07 -0700
  • 2893d0add Merge pull request #5211 from guoyuanplct/develop by Martin Kroeker 2025-04-10 09:43:03 -0700
  • f409ee030 deploy: ed1e470663 by martin-frbg 2025-04-10 15:20:15 +0000
  • ed1e47066 Merge pull request #5217 from haampie/hs/fix/darwin-gcc by Martin Kroeker 2025-04-10 08:19:46 -0700
  • 3d6d026fe (refs/pull/5217/head) no-gcse when loongarch64 by Harmen Stoppels 2025-04-10 15:44:31 +0200
  • 51ba70f47 test_potrs.c: remove pragma darwin-aarch64 support by Harmen Stoppels 2025-04-10 15:20:34 +0200
  • ec146157d (refs/pull/5215/head) Use SVE kernel for S/DGEMVT for SVE machines by Annop Wongwathanarat 2025-04-02 09:11:58 +0000
  • de2380e5a Merge pull request #5214 from martin-frbg/issue5200 by Martin Kroeker 2025-04-09 10:37:52 -0700
  • a34b487f2 (refs/pull/5214/head) Remove spurious cast from Alpha and Cell's DEFAULT_ALIGN by Martin Kroeker 2025-04-09 17:25:46 +0200
  • 1b3e7cc49 Merge pull request #5212 from martin-frbg/lapack1119 by Martin Kroeker 2025-04-09 04:37:14 -0700
  • 9056b811f deploy: 4270d5bc43 by martin-frbg 2025-04-09 08:55:54 +0000
  • 4270d5bc4 Merge pull request #5204 from martin-frbg/issue4692 by Martin Kroeker 2025-04-09 01:48:31 -0700
  • 880e43ee5 Merge pull request #5198 from martin-frbg/woadlldebug by Martin Kroeker 2025-04-08 14:22:51 -0700
  • 70865a894 Merge pull request #5180 from ywwry66/openmp_use_cmake by Martin Kroeker 2025-04-08 13:16:07 -0700
  • 77b14e067 deploy: f0f274725d by martin-frbg 2025-04-08 14:30:30 +0000
  • f0f274725 Merge pull request #5207 from martin-frbg/issue5202 by Martin Kroeker 2025-04-08 07:14:16 -0700
  • 94fb7033a (refs/pull/5212/head) Fix incomplete error message (Reference-LAPACK PR 1119) by Martin Kroeker 2025-04-08 07:03:11 -0700
  • 1ff303f36 (refs/pull/5211/head) Optimizing the Implementation of GEMV on the RISC-V V Extension by lglglglgy 2025-04-08 21:18:00 +0800
  • fc8090b60 (refs/pull/5207/head) Move additional omp dependency to EXTRALIB by Martin Kroeker 2025-04-08 11:54:36 +0200
  • 1c5d0d553 move libomp to extralib by Martin Kroeker 2025-04-08 10:44:36 +0200
  • 198319a24 deploy: 67c5bdd639 by martin-frbg 2025-04-07 19:24:10 +0000
  • 67c5bdd63 Azure CI: Update flang call in OSX_LLVM_flangnew job (#5208) by Martin Kroeker 2025-04-07 12:20:43 -0700
  • eebedaeeb (refs/pull/5208/head) Update azure-pipelines.yml by Martin Kroeker 2025-04-07 21:19:54 +0200
  • 83fc81fe5 Update azure-pipelines.yml by Martin Kroeker 2025-04-07 17:49:03 +0200
  • cee76bf59 Update azure-pipelines.yml by Martin Kroeker 2025-04-07 16:23:28 +0200
  • 8a172486c Update azure-pipelines.yml by Martin Kroeker 2025-04-07 15:34:42 +0200
  • 96795e0f4 Update azure-pipelines.yml by Martin Kroeker 2025-04-07 15:24:20 +0200
  • 02c6bce8e Update flang call in OSX_LLVM_flangnew job by Martin Kroeker 2025-04-07 14:47:41 +0200
  • 1ed962d25 Fix compilation with xcode16.3/clang17/gcc14 by Martin Kroeker 2025-04-06 10:44:48 -0700
  • 7e0af691b deploy: f0008f50cc by martin-frbg 2025-04-05 21:44:00 +0000
  • f0008f50c Merge pull request #5206 from ColumbusAI/develop by Martin Kroeker 2025-04-05 23:43:31 +0200
  • 7bf848454 (refs/pull/5206/head) Update zsum.c -- fixed spelling error to successfully compile by ColumbusAI 2025-04-05 09:57:53 -0700
  • 0aa5ef29e (refs/pull/5204/head) Repeat the libs target's "ln" in the all target to ensure completeness by Martin Kroeker 2025-04-03 23:54:56 +0200
  • 0be743d5b deploy: f90eff306d by martin-frbg 2025-04-03 16:11:09 +0000
  • f90eff306 Merge pull request #5197 from e4t/z-arch-exec-stack by Martin Kroeker 2025-04-03 18:10:41 +0200
  • 04915be82 (refs/pull/5203/head) Add vector registers to clobber list to prevent compiler optimization. by Vaisakh K V 2025-04-03 12:18:43 +0530
  • 3fc15ad81 (refs/pull/5198/head) Fix pdb file creation in debug dll builds with CMake on Windows/WoA by Martin Kroeker 2025-03-30 23:22:09 +0200
  • 61b9339d3 (refs/pull/5197/head) getarch/cpuid.S: Fix warning about executable stack by Egbert Eich 2025-03-28 08:59:26 +0100
  • ea6515c4b On zarch don't produce objects from assembler with a writable stack section by Egbert Eich 2025-03-26 17:35:21 +0100
  • dd77d3844 deploy: f33943d73e by martin-frbg 2025-03-27 08:18:23 +0000
  • f33943d73 Merge pull request #5196 from martin-frbg/issue5193 by Martin Kroeker 2025-03-27 09:17:54 +0100
  • 251c3f857 (refs/pull/5180/head) gh m1: fix mixed linkage when built with OpenMP and clang+gfortran by Ruiyang Wu 2025-03-26 23:19:40 -0400
  • 1b0c0f00e CMake: Avoid mixed OpenMP linkage by Ruiyang Wu 2025-03-13 02:25:52 -0400
  • 02fd1df10 CMake: Pass `OpenMP` compiler and linker flags through CMake targets by Ruiyang Wu 2025-03-12 20:41:55 -0400
  • 8b3553420 Merge pull request #5195 from martin-frbg/update-gensymbolpl by Martin Kroeker 2025-03-26 23:39:53 +0100
  • 51c1fb1f9 (refs/pull/5196/head) Fix ?spmv build and misinterpretation of NO_LAPACK=0 by Martin Kroeker 2025-03-26 23:36:49 +0100
  • 3ca1ba1be (refs/pull/5195/head) resynchronize with the posix shell version by Martin Kroeker 2025-03-26 18:37:11 +0100
  • dc9bb4fd8 (refs/pull/5194/head) FIX by hezhiqiang 2025-03-27 01:11:44 +0800
  • b46138073 deploy: 72f0abeed5 by martin-frbg 2025-03-26 10:22:33 +0000
  • 72f0abeed Merge pull request #5191 from Harishmcw/CMake_Symbol_Fix by Martin Kroeker 2025-03-26 11:22:07 +0100
  • 1724b3f10 (refs/pull/5191/head) DLL symbol pre/postfixing in CMake builds by Harishmcw 2025-03-26 10:55:50 +0530
  • c2e7ab535 DLL symbol pre/postfixing in CMake builds by Harishmcw 2025-03-26 10:50:29 +0530
  • 200771078 Merge pull request #5190 from Harishmcw/develop by Martin Kroeker 2025-03-25 22:32:43 +0100
  • 360a0c3dd deploy: 4e3afa7beb by martin-frbg 2025-03-25 21:03:19 +0000
  • 4e3afa7be Merge pull request #5175 from shubhamsvc/dgemv_thread_throttling by Martin Kroeker 2025-03-25 22:02:48 +0100
  • c0a5c9655 (refs/pull/5190/head) Fix missing commas in gensymbol.pl by Harishmcw 2025-03-24 13:49:55 +0530
  • 030bfd1b3 Remove unused and conflicting declarations from the f2c preamble by Martin Kroeker 2025-03-21 09:21:16 +0100
  • 140da0c8f Fix f2c conversion errors by Martin Kroeker 2025-03-20 22:27:05 +0100
  • cf4c5a6d8 Update f2c-translated stand-ins to include GEMMTR by Martin Kroeker 2025-03-20 20:20:41 +0100
  • d1d3342fe Restore OpenBLAS version of header and add GEMMTR by Martin Kroeker 2025-03-20 15:44:59 +0100
  • 9fe2784b0 Delete non-applicable header entries from Reference-LAPACK by Martin Kroeker 2025-03-20 11:44:10 +0100
  • 53e8e569a (refs/pull/5188/head) Fix missing quotes around variables that might be empty by Martin Kroeker 2025-03-20 11:12:21 +0100
  • a9d24e6cb Fix source files for gemmtr and sbgemmt by Martin Kroeker 2025-03-20 11:10:55 +0100
  • 40e1e58e9 Fix DLL symbol name pre/postfixing on Windows by Martin Kroeker 2025-03-19 22:52:41 +0100
  • 088f3b435 Update CBLAS3 tests from Reference-LAPACK to add GEMMT(R) testing by Martin Kroeker 2025-03-19 22:41:20 +0100
  • cfb7685a7 Add cblas_?gemmtr aliases of cblas_?gemmt by Martin Kroeker 2025-03-19 22:36:56 +0100
  • 8e289ecdd (refs/pull/5175/head) Simplified thread throttling function in gemv by shubham.chaudhari 2025-03-18 13:24:05 +0530
  • 189dbbc04 Add thread throttling for dynamic arch neoversev1 by shubham.chaudhari 2025-03-04 16:08:55 +0530
  • b6cb5ece5 Add thread throttling profile for DGEMV on NEOVERSEV1 by shubham.chaudhari 2025-02-28 13:10:40 +0530
  • 64a6bc16b deploy: 51c244a098 by martin-frbg 2025-03-15 16:25:59 +0000
  • 51c244a09 Merge pull request #5184 from taoye9/fix_sbgemv_n_bug by Martin Kroeker 2025-03-15 17:25:33 +0100
  • f27ba5efd (refs/pull/5184/head) fix bugs in aarch64 sbgemv_n kernel by Ye Tao 2025-03-14 17:55:40 +0000
  • e9fbe0a83 Merge pull request #5183 from annop-w/fix_sbgemv_t by Martin Kroeker 2025-03-13 23:04:09 +0100
  • edef2e444 (refs/pull/5183/head) Fix bug in ARM64 sbgemv_t by Annop Wongwathanarat 2025-03-13 20:55:31 +0000
  • b55ca71d5 Merge pull request #5182 from annop-w/sgemm_ncopy by Martin Kroeker 2025-03-13 16:04:39 +0100
  • 2f778554b Merge pull request #5181 from taoye9/change_sbgemn_cast_bf16 by Martin Kroeker 2025-03-13 13:50:26 +0100
  • acd9975d6 deploy: 66e0f1e621 by martin-frbg 2025-03-13 11:05:23 +0000
  • 66e0f1e62 Merge pull request #5178 from martin-frbg/lapack_cplx_dummy by Martin Kroeker 2025-03-13 11:57:29 +0100
  • 9807f5658 (refs/pull/5182/head) Optimize aarch64 sgemm_ncopy by Annop Wongwathanarat 2025-03-12 21:26:27 +0000
  • 43f413748 deploy: 1ba02656e6 by martin-frbg 2025-03-13 06:34:21 +0000
  • 1ba02656e Merge pull request #5177 from martin-frbg/cmakelapacke by Martin Kroeker 2025-03-13 07:33:52 +0100
  • 8a418b1aa (refs/pull/5178/head) Add dummy implementations for the LAPACK_COMPLEX_CUSTOM case by Martin Kroeker 2025-03-12 23:20:16 +0100
  • b34235ca6 (refs/pull/5177/head) Fix inclusion of deprecated interfaces and cgesvdq/strsyl3 by Martin Kroeker 2025-03-12 22:41:50 +0100
  • 37b854769 Merge pull request #5173 from nakagawa-fj/gemm_load_imbalance by Martin Kroeker 2025-03-12 22:38:02 +0100
  • a3e7b1607 Merge pull request #5157 from manaalmj/feature by Martin Kroeker 2025-03-12 21:08:23 +0100
  • 67156a641 deploy: 8865850496 by martin-frbg 2025-03-12 17:50:21 +0000
  • 886585049 Merge pull request #5176 from annop-w/fix_sbgemv_t by Martin Kroeker 2025-03-12 18:49:54 +0100
  • 4c00099ed (refs/pull/5181/head) replace customize bf16_to_fp32 with arm neon vcvtah_f32_bf16 by Ye Tao 2025-03-12 16:20:15 +0000
  • a085b6c9e (refs/pull/5176/head) Fix aarch64 sbgemv_t compilation error for GCC < 13 by Annop Wongwathanarat 2025-03-12 14:49:10 +0000
  • 80d3c2ad9 (refs/pull/5173/head) Add Improving Load Imbalance in Thread-Parallel GEMM by Masato Nakagawa 2025-03-11 20:18:20 +0900
  • 5c4e38ab1 (refs/pull/5157/head) Optimize gemv_n_sve kernel by manjam01 2025-02-27 09:39:06 +0000
  • 27adf9f80 deploy: 39eb43d441 by martin-frbg 2025-03-07 12:58:24 +0000
  • 39eb43d44 Improve thread safety of pthreads builds that rely on C11 atomic operations for locking (#5170) by Martin Kroeker 2025-03-07 13:48:28 +0100
  • 3a3318006 (refs/pull/5170/head) Use atomic acquire on load, release on store by Martin Kroeker 2025-03-07 10:31:33 +0100
  • 6610db4eb switch to full ACQ_REL semantics by Martin Kroeker 2025-03-04 22:37:51 +0100
  • 98206dbdb Tighten memory orders for C11 atomic operations by Martin Kroeker 2025-03-04 20:04:22 +0100
  • 8c65ea4ed deploy: 1d5ed5c46b by martin-frbg 2025-03-04 15:39:58 +0000
  • 1d5ed5c46 Merge pull request #5168 from taoye9/add_sbgemvn_on_neonversen2 by Martin Kroeker 2025-03-04 16:39:22 +0100
  • 7338a473a Merge pull request #5150 from Harishmcw/WoA-Experiments by Martin Kroeker 2025-03-03 21:45:53 +0100