9121 Commits (3a3318006c69d6aebfc511e9b5e54c7a7dbdf0e8)
 

Author SHA1 Message Date
  Martin Kroeker 3a3318006c
Use atomic acquire on load, release on store 11 months ago
  Martin Kroeker 6610db4eb4
switch to full ACQ_REL semantics 11 months ago
  Martin Kroeker 98206dbdb9
Tighten memory orders for C11 atomic operations 11 months ago
  Martin Kroeker 1d5ed5c46b
Merge pull request #5168 from taoye9/add_sbgemvn_on_neonversen2 11 months ago
  Martin Kroeker 7338a473a7
Merge pull request #5150 from Harishmcw/WoA-Experiments 11 months ago
  Martin Kroeker 5f200dca54
Merge pull request #5166 from martin-frbg/issue5158 11 months ago
  Martin Kroeker 8b98db13e3
Merge pull request #5167 from taoye9/fix_sbgemv_n_kernel_typo 11 months ago
  Ye Tao 6b8b35cdf2 fix minior issues of redeclaration of float x0,x1 in sbgemv_n_neon.c 11 months ago
  Ye Tao 38ee7c9301 Add dispatch of SBGEMVNKERNEL for NEOVERSEN2 and NEOVERSEV2 11 months ago
  Martin Kroeker 217324d880
Merge pull request #5162 from taoye9/add_sbgemv_tests 11 months ago
  Martin Kroeker e4630ed15a
Merge pull request #5160 from taoye9/sbgemv_n_neon 11 months ago
  Martin Kroeker 35914aa9a2
Expose the option to build without LAPACKE to ccmake 11 months ago
  Martin Kroeker 2b941c44b5
Merge branch 'develop' into sbgemv_n_neon 11 months ago
  Martin Kroeker c797e27a1c
Merge pull request #5159 from annop-w/sbgemv_t_bfdot 11 months ago
  Ye Tao 4346b91559 add beta and alpha testcase for sbgemv 11 months ago
  Ye Tao 35bdbca153 Add sbgemv_n_neon kernel for arm64. 11 months ago
  Annop Wongwathanarat edaf51dd99 Add sbgemv_t_bfdot kernel for ARM64 11 months ago
  Martin Kroeker ef9e3f7159
Merge pull request #5149 from martin-frbg/fixup5077-5088 11 months ago
  Martin Kroeker 09ba099461
make throttling code conditional on SMP 11 months ago
  Harishmcw 030ae1fd97 Redefined threading logic for WoA 11 months ago
  Martin Kroeker 1533fe49be
Merge pull request #5144 from taoye9/dispatch_neoversve2_to_neoversven2 11 months ago
  Martin Kroeker c03a81b927
Merge pull request #5141 from michalowski-arm/fork-throttle 11 months ago
  Martin Kroeker 643966d9c7
Merge pull request #5146 from martin-frbg/issue5123 11 months ago
  Martin Kroeker 77fba0f400
Fix "dummy2" flag handling 11 months ago
  Ye Tao f0bea79a6e dispatch NEOVERSEV2 to NEOVERSEN2 under dynamic setting 11 months ago
  Martin Kroeker 20d1118865
Merge pull request #5143 from martin-frbg/issue5111 11 months ago
  Martin Kroeker 75b958a018
Transform the B array back if necessary before returning 11 months ago
  Marek Michalowski 650a062e19 Add thread throttling profile for SGEMV on `NEOVERSEV2` 11 months ago
  Marek Michalowski b723c1b7b7 Add thread throttling profile for SGEMM on `NEOVERSEV2` 11 months ago
  Martin Kroeker ceb8f1e34b
Merge pull request #5140 from martin-frbg/issue5139 11 months ago
  Martin Kroeker f1fa370579
fix missing endif 11 months ago
  Martin Kroeker 6d1444be3a
Add ARM64 options for NVIDIA HPC 11 months ago
  Martin Kroeker eb84aac7ad
Merge pull request #5084 from quic/topic/sgemm_direct_sme1 11 months ago
  Martin Kroeker abbd78aa59
Merge pull request #5138 from martin-frbg/issue5131 11 months ago
  Martin Kroeker ebcab90976
Handle flang-new runtime library linking on Linux like classic-flang 11 months ago
  Martin Kroeker ed1584666c
Merge pull request #5137 from martin-frbg/issue5136 1 year ago
  Martin Kroeker b9ae246f20
define USE_TRMM for RISCV64 targets as well 1 year ago
  Martin Kroeker 86cf9d8a2e
Merge pull request #5133 from OpenMathLib/revert-4920-issue4917 1 year ago
  Martin Kroeker 0b3c56968d
Merge pull request #5135 from martin-frbg/ghwf-n2 1 year ago
  Martin Kroeker c1bb90a823
remove the express NeoverseN2 target from the Cobalt100 job 1 year ago
  Martin Kroeker 77c638db67
Revert "Fix potential inaccuracy in multithreaded level3 related to SWITCH_RATIO" 1 year ago
  Vaisakh K V f66ca05b31
Merge branch 'develop' into topic/sgemm_direct_sme1 1 year ago
  Vaisakh K V d23eb3b93e Support for SME1 based sgemm_direct kernel for cblas_sgemm level 3 API 1 year ago
  Martin Kroeker a64b75a2e0
Merge pull request #5127 from Harishmcw/gesv-threshold 1 year ago
  Martin Kroeker 453efbd103
Merge pull request #5128 from martin-frbg/issue5120 1 year ago
  Martin Kroeker 877d5a5be6
Add -O2 to flang flags when building on WoA in Release mode 1 year ago
  Martin Kroeker 8d487ef6eb
Merge pull request #5124 from XiWeiGu/LoongArch64-LA264-lapack-fixed 1 year ago
  Harish-Gits daf16b8229 Adjusted GESV threading logic for optimal performance on WoA 1 year ago
  Martin Kroeker e8b11a126b
Merge pull request #5125 from martin-frbg/issue5122 1 year ago
  Martin Kroeker 9a3948df82
Merge pull request #5126 from martin-frbg/cirrusbsd4 1 year ago