9529 Commits (27304fb29894af8b49815a35d710ac59c8d41ca6)
 

Author SHA1 Message Date
  Mark Ryan 27304fb298
Merge 7fcad02dc2 into c31861ea62 8 months ago
  Martin Kroeker c31861ea62
Merge pull request #5435 from martin-frbg/update_rvv_ci 8 months ago
  Martin Kroeker 57c2936a43
Merge branch 'OpenMathLib:develop' into update_rvv_ci 8 months ago
  Martin Kroeker 6d070820fc
Merge pull request #5436 from martin-frbg/update_osx_ci 8 months ago
  Martin Kroeker 1c7251ca20
remove the -llto_library option for any osx fortran compiler 8 months ago
  Martin Kroeker a1331406a3
drop (re)installation of cmake on osx runners 8 months ago
  Martin Kroeker c42fccccb5
Drop installation of cmake 8 months ago
  Martin Kroeker 4c1a4e60a6
Update toolchain to its latest nightly build 8 months ago
  Mark Ryan 7fcad02dc2 fix RVV 1.0 detection code 9 months ago
  Martin Kroeker 06c09deee9
Merge pull request #5426 from hideaki-motoki/issue5417_axpy_sve 9 months ago
  Martin Kroeker da7d0f4a38
Merge pull request #5427 from yuanjia111/develop 9 months ago
  yuanjia c2cc7a3602 riscv64: optimize gemv_t_vector.c 9 months ago
  h-motoki e23f9c6642 Merge remote-tracking branch 'upstream/develop' into issue5417_axpy_sve 9 months ago
  Martin Kroeker b3f247ae5a
Merge pull request #5425 from martin-frbg/fixup5389 9 months ago
  h-motoki 855945befb Implementing SVE in [SD]AXPY Kernels for A64FX and Graviton3E 9 months ago
  Martin Kroeker 7c1839899e
Increase assumed L2 sizes for RISCV X280 / ZVL256B and for SVE-capable ARM64 9 months ago
  Martin Kroeker 9c43301b6d
Merge pull request #5421 from reibax-marcus/develop 9 months ago
  Martin Kroeker 9d6df1dd3e
Merge pull request #5422 from ChipKerchner/addRVVVectorizedPacking 9 months ago
  Martin Kroeker f3b2a15fad
Merge pull request #5420 from yuanjia111/develop 9 months ago
  Chip Kerchner 64401b4417 Disable vectorized packing for DGEMM - since it is slower than scalar. 9 months ago
  Martin Kroeker 5e43ba948c
Merge pull request #5419 from Mousius/bgemm-optimisation 9 months ago
  Chip Kerchner c00afc86a6 Add and use vectorized packing to ZVL128B and ZVL256B. Up to 3x+ faster than generic scalar functions. 9 months ago
  Xabier Marquiegui 3a6b79c50f fix: broken cblas installation when using makefile based builds 9 months ago
  yuanjia 803e8d4838 Move the value assignment of vector x in gemv_n_sve.c to the outermost loop to reduce the repeated data retrieval. 9 months ago
  Chris Sidebottom 5f47b872f1 Remove older kernels for BGEMM on NEOVERSEV1 9 months ago
  Chris Sidebottom 114316f361 Optimize SBGEMM / BGEMM for NEOVERSEV1 further 9 months ago
  Martin Kroeker 75c6ab4036
CI: Update WoA job to use LLVM 20.1.8 and avoid stray preinstalled LLVM19 (#5411) 9 months ago
  Martin Kroeker 5c5f852ee3
Merge pull request #5415 from martin-frbg/Fixum-5399 9 months ago
  Martin Kroeker f1ee61ea30
Include NEON header for the bfloat conversion functions 9 months ago
  Martin Kroeker b3ffd5524a
Include NEON header for the bfloat conversion functions 9 months ago
  Martin Kroeker d23680b81d
Merge pull request #5407 from nakagawa-fj/feature/gemm_divide_rate_for_neoversev1 10 months ago
  Martin Kroeker b4cc4be2ce
Merge pull request #5410 from martin-frbg/issue5404 10 months ago
  Martin Kroeker 0968dddf1a
Merge pull request #5409 from martin-frbg/issue5372 10 months ago
  Martin Kroeker eddfe1e6b3
Merge pull request #5408 from ChipKerchner/fixRISCV64GEMVInitializationAndWarnings 10 months ago
  Martin Kroeker 30d11bc92c
Adjust multithreading threshold and add an intermediate step 10 months ago
  Martin Kroeker a3b9c933c5
mark xbuffer as volatile to work around gcc15.1 optimizer bug 10 months ago
  Chip Kerchner 72f082f31d Fix bad vector zero initializer and other compiler warnings for RISC-V. 10 months ago
  Masato Nakagawa 7e29f11396 Multi-thread GEMM Performance Improvement on NeoverseV1 (DIVIDE_RATE=1) 10 months ago
  Martin Kroeker 9a64b32b44
Merge pull request #5406 from martin-frbg/fixbgemmtest 10 months ago
  Martin Kroeker b66a01f909
Fix building of bgemm tests on GEMM3M-capable (x86) targets 10 months ago
  Martin Kroeker a5e7c0e3e0
Merge pull request #5396 from abhishek-iitmadras/abhishekk_bfloat16 10 months ago
  abhishek-fujitsu 6356190d06 fix gfortran link path in dynamic_arch.yml 10 months ago
  abhishek-fujitsu 4c8dcb3a8f Darwin/arm64: disable SVE/SME and fix gfortran link path 10 months ago
  Martin Kroeker 33b50548eb
Merge pull request #5403 from martin-frbg/issue5402 10 months ago
  Martin Kroeker c504aedca1
Merge pull request #5400 from Mousius/neoversev2-target 10 months ago
  Martin Kroeker b9e107932a
add NeoverseV2 10 months ago
  Martin Kroeker 2f89a5970e
fix NeoverseV2 typo 10 months ago
  Martin Kroeker a9e8fa06bf
Introduce a (crude) threshold to multithreading 10 months ago
  Martin Kroeker b4c2b34a45
Merge pull request #5401 from martin-frbg/followup-5397 10 months ago
  Martin Kroeker c9204f7b6f
Merge pull request #5399 from Mousius/bgemm-8x4 10 months ago