9103 Commits (ef9e3f71595edfc69aadeb34d99a96d5f72a29a2)
 

Author SHA1 Message Date
  Martin Kroeker ef9e3f7159
Merge pull request #5149 from martin-frbg/fixup5077-5088 1 year ago
  Martin Kroeker 09ba099461
make throttling code conditional on SMP 1 year ago
  Martin Kroeker 1533fe49be
Merge pull request #5144 from taoye9/dispatch_neoversve2_to_neoversven2 1 year ago
  Martin Kroeker c03a81b927
Merge pull request #5141 from michalowski-arm/fork-throttle 1 year ago
  Martin Kroeker 643966d9c7
Merge pull request #5146 from martin-frbg/issue5123 1 year ago
  Martin Kroeker 77fba0f400
Fix "dummy2" flag handling 1 year ago
  Ye Tao f0bea79a6e dispatch NEOVERSEV2 to NEOVERSEN2 under dynamic setting 1 year ago
  Martin Kroeker 20d1118865
Merge pull request #5143 from martin-frbg/issue5111 1 year ago
  Martin Kroeker 75b958a018
Transform the B array back if necessary before returning 1 year ago
  Marek Michalowski 650a062e19 Add thread throttling profile for SGEMV on `NEOVERSEV2` 1 year ago
  Marek Michalowski b723c1b7b7 Add thread throttling profile for SGEMM on `NEOVERSEV2` 1 year ago
  Martin Kroeker ceb8f1e34b
Merge pull request #5140 from martin-frbg/issue5139 1 year ago
  Martin Kroeker f1fa370579
fix missing endif 1 year ago
  Martin Kroeker 6d1444be3a
Add ARM64 options for NVIDIA HPC 1 year ago
  Martin Kroeker eb84aac7ad
Merge pull request #5084 from quic/topic/sgemm_direct_sme1 1 year ago
  Martin Kroeker abbd78aa59
Merge pull request #5138 from martin-frbg/issue5131 1 year ago
  Martin Kroeker ebcab90976
Handle flang-new runtime library linking on Linux like classic-flang 1 year ago
  Martin Kroeker ed1584666c
Merge pull request #5137 from martin-frbg/issue5136 1 year ago
  Martin Kroeker b9ae246f20
define USE_TRMM for RISCV64 targets as well 1 year ago
  Martin Kroeker 86cf9d8a2e
Merge pull request #5133 from OpenMathLib/revert-4920-issue4917 1 year ago
  Martin Kroeker 0b3c56968d
Merge pull request #5135 from martin-frbg/ghwf-n2 1 year ago
  Martin Kroeker c1bb90a823
remove the express NeoverseN2 target from the Cobalt100 job 1 year ago
  Martin Kroeker 77c638db67
Revert "Fix potential inaccuracy in multithreaded level3 related to SWITCH_RATIO" 1 year ago
  Vaisakh K V f66ca05b31
Merge branch 'develop' into topic/sgemm_direct_sme1 1 year ago
  Vaisakh K V d23eb3b93e Support for SME1 based sgemm_direct kernel for cblas_sgemm level 3 API 1 year ago
  Martin Kroeker a64b75a2e0
Merge pull request #5127 from Harishmcw/gesv-threshold 1 year ago
  Martin Kroeker 453efbd103
Merge pull request #5128 from martin-frbg/issue5120 1 year ago
  Martin Kroeker 877d5a5be6
Add -O2 to flang flags when building on WoA in Release mode 1 year ago
  Martin Kroeker 8d487ef6eb
Merge pull request #5124 from XiWeiGu/LoongArch64-LA264-lapack-fixed 1 year ago
  Harish-Gits daf16b8229 Adjusted GESV threading logic for optimal performance on WoA 1 year ago
  Martin Kroeker e8b11a126b
Merge pull request #5125 from martin-frbg/issue5122 1 year ago
  Martin Kroeker 9a3948df82
Merge pull request #5126 from martin-frbg/cirrusbsd4 1 year ago
  Martin Kroeker 7f1f776f58
Update FreeBSD jobs to 14.2 1 year ago
  Martin Kroeker 81eed868b6
Restore the non-vectorized code from before PR4880 for POWER8 1 year ago
  Martin Kroeker 98b5ef929c
Restore the non-vectorized code from before PR4880 for POWER8 1 year ago
  gxw 2c4a5cc6e6 LoongArch64: Fixed snrm2_lsx.S and cnrm2_lsx.S 1 year ago
  gxw 9e75d6b3d1 LoongArch64: Fixed swap_lsx.S 1 year ago
  gxw e8c740368c LoongArch64: Fixed rot_lsx.S ane crot_lsx.S 1 year ago
  Hao Chen c2212d0abd LoongArch64: Fixed copy_lsx.S 1 year ago
  Hao Chen 7f1ebc7ae6 LoongArch64: Fixed iamax_lsx.S 1 year ago
  Hao Chen 31d326f895 LoongArch64: Fixed dot_lsx.S 1 year ago
  Hao Chen 5d6356bc16 LoongArch64: Fixed amax_lsx.S 1 year ago
  Martin Kroeker f42ce7067f
Merge pull request #5116 from martin-frbg/issue5110 1 year ago
  Martin Kroeker 7478c10268
Merge branch 'OpenMathLib:develop' into issue5110 1 year ago
  Martin Kroeker c54f5417cc
Merge pull request #5118 from martin-frbg/zrot_utestext 1 year ago
  Martin Kroeker 57208b8bce
Disable tests with incx,incy=0 (undefined behavior) 1 year ago
  Martin Kroeker 3a4a9b21eb
Disable tests with incx,incy=0 (undefined behavior) 1 year ago
  Martin Kroeker 60d0be0e97
Update nrm2.c 1 year ago
  Martin Kroeker 0fd5448b2c
Handle INCX=0 1 year ago
  Martin Kroeker 1b85b6a396
Merge pull request #5108 from taoye9/sbgemm_neoversev1 1 year ago