9438 Commits (3ea6322eff62e05c77931f0c2bcce329ca5c5cf2)
 

Author SHA1 Message Date
  Martin Kroeker 3ea6322eff
Merge pull request #5377 from Mousius/test-fixes 11 months ago
  Chris Sidebottom 3f110c8272 Improve bgemm and sbgemm testing 11 months ago
  Martin Kroeker 81b30d4538
Merge pull request #5374 from martin-frbg/fixup-5373 11 months ago
  Martin Kroeker aad97c7763
Fix return type declaration 11 months ago
  Martin Kroeker 7acb122a98
Merge pull request #5373 from Mousius/bgemm-optimized 11 months ago
  Chris Sidebottom 740efd71c4 Add optimized BGEMM kernel for NEOVERSEV1 target 11 months ago
  Martin Kroeker e927373f62
Merge pull request #5371 from martin-frbg/fixup-5357 1 year ago
  Martin Kroeker 9a272fece6
Re-enable the BGEMM tests 1 year ago
  Martin Kroeker b54aec804e
remove spurious include 1 year ago
  Martin Kroeker 343830c26f
Add BGEMM parameter tables 1 year ago
  Martin Kroeker b37516add6
Add BGEMM parameters 1 year ago
  Martin Kroeker d030f81380
Merge pull request #5369 from martin-frbg/lapack1144 1 year ago
  Martin Kroeker b746f0eda3
Allocate IWORK to hold at least the one element for workspace queries 1 year ago
  Martin Kroeker b8f66ba0ee
Merge pull request #5367 from Mousius/bgemm-init 1 year ago
  Martin Kroeker cdebb4fd4b
Merge pull request #5365 from martin-frbg/issue5324 1 year ago
  Martin Kroeker ff614575c9
Fix arm64 HAVE_SME setting for DYNAMIC_ARCH builds 1 year ago
  Martin Kroeker 0e11537cab
Merge pull request #5357 from Mousius/bgemm-init 1 year ago
  Chris Sidebottom 8cd4be8d47 Temporarily disable test_bgemm 1 year ago
  Chris Sidebottom 66d9185ebe Fix CMake support 1 year ago
  Martin Kroeker 98aefb70b4
Merge pull request #5292 from isharif168/optimized_gemv_n_1x3 1 year ago
  Martin Kroeker fd37406817
Merge branch 'develop' into optimized_gemv_n_1x3 1 year ago
  Chris Sidebottom 48394384ef Use correct constants for per-target BGEMM/SBGEMM 1 year ago
  Chris Sidebottom 73bf0b941a Add bgemm to gensymbol 1 year ago
  Chris Sidebottom f95e7b0e32 Add infrastructure for BGEMM 1 year ago
  Martin Kroeker 15d6e58510
Merge pull request #5364 from martin-frbg/blashalf 1 year ago
  Martin Kroeker 04bb5acd79
change BLAS_HALF to BLAS_BFLOAT16 (another missed rename) 1 year ago
  Martin Kroeker 3d31887073
Merge pull request #5362 from Mousius/fix-bf16 1 year ago
  Martin Kroeker 0ddf8ebd42
Merge pull request #5354 from pratiklp00/p11 1 year ago
  Martin Kroeker d2ea9bbb6d
Merge pull request #5363 from guoyuanplct/develop 1 year ago
  guoyuanplct 4ff549a450
Update CONTRIBUTORS.md 1 year ago
  guoyuanplct 309c48e327
Update CONTRIBUTORS.md 1 year ago
  Chris Sidebottom 552e1c7a7a Correct compiler flags for NEOVERSEV1 target 1 year ago
  Chris Sidebottom 46b9b7a080 Also enable BFLOAT16 for make cirun 1 year ago
  Chris Sidebottom eaaa628af2 Enable BUILD_BFLOAT16 in cirun 1 year ago
  Chris Sidebottom 7a97c4ca97 Rename HALF -> BFLOAT16 in some more places 1 year ago
  Martin Kroeker ee6560c89f
Merge pull request #5360 from sertonix/cpuid-arm 1 year ago
  Sertonix 8d11e4630c Fix cpuid.S on arm 1 year ago
  Martin Kroeker 03a4afcf14
Merge pull request #5359 from martin-frbg/gitign_isnan 1 year ago
  Martin Kroeker 901de8f33a
remove lapacke_mangling.h and add la_xisnan.mod 1 year ago
  Martin Kroeker ce6991780a
Merge pull request #5356 from ilina-linaro/ilina-woa 1 year ago
  Martin Kroeker df013c5e28
Merge pull request #5358 from iha-taisei/dot_unroll 1 year ago
  Iha, Taisei f7ad906b49 Performance improvements of [SD]DOT with loop-unrolling on A64FX 1 year ago
  Lina Iyer 7f360001f9
Update README.md to include Windows on Arm64 1 year ago
  Martin Kroeker 36c2589d3a
Merge pull request #5355 from tetsuzo-usui/add_parallel_laed3 1 year ago
  Usui, Tetsuzo 14107e37d9 Add parallel laed3 1 year ago
  Martin Kroeker a06bcf836b
Merge pull request #5353 from nakagawa-fj/feature/gemm_divide_rate_for_A64FX 1 year ago
  Masato Nakagawa 5253c8f165 Multi-thread Performance Improvement of GEMM with DIVIDE_RATE=1 for 1 year ago
  Martin Kroeker 8f0a1a3f82
Merge pull request #5303 from martin-frbg/issue5289 1 year ago
  Martin Kroeker 2c0dd2468e
Merge pull request #5350 from martin-frbg/issue5341 1 year ago
  Martin Kroeker 7ae24d0b85
Merge pull request #5351 from martin-frbg/lapack1140 1 year ago