21 Commits (b329e45288c2e7fc0ef15c4e8a7b3c8dfd74a930)

Author SHA1 Message Date
  Rafael Cardoso Fernandes Sousa c78fdcc80d [POWER] Add support for SMALL_MATRIX_OPT 4 years ago
  kavanabhat 9cc95e5657 AIX changes for P10 with GNU Compiler 4 years ago
  kavanabhat fe3c778c51 AIX changes for P10 with GNU Compiler 4 years ago
  Rajalakshmi Srinivasaraghavan 55bb9f639a POWER10: Optimized zgemv 4 years ago
  Martin Kroeker 86c5a0013f
Add workaround for LAPACK testsuite failures with the NVIDIA HPC compiler 4 years ago
  Rajalakshmi Srinivasaraghavan eff7c9166e Optimize cdot function for POWER10 5 years ago
  Gordon Fossum 213c0e7abb Added special unrolled vectorized versions of "Solve" for specific sizes, 5 years ago
  Rajalakshmi Srinivasaraghavan 6e364981a8 Optimize sdot/ddot for POWER10 5 years ago
  Rajalakshmi Srinivasaraghavan dd7a9cc5bf POWER10: Change dgemm unroll factors 5 years ago
  Rajalakshmi Srinivasaraghavan b435491885 Optimize caxpy for POWER10 5 years ago
  Rajalakshmi Srinivasaraghavan c24ba8b1dd Optimize saxpy for POWER10 5 years ago
  Rajalakshmi Srinivasaraghavan ad745c0bae Optimize scopy/ccopy for POWER10 5 years ago
  Rajalakshmi Srinivasaraghavan 0826d68f93 POWER10: Change the packing format for bfloat16 5 years ago
  Martin Kroeker 2061f7fdff
Rename "HALF" and "sh" to "BFLOAT16" and "sb" 5 years ago
  Rajalakshmi Srinivasaraghavan 2df4235e00 Optimize dcopy/zcopy for POWER10 5 years ago
  Rajalakshmi Srinivasaraghavan be43d2cb96 Optimize daxpy/zaxpy for POWER10 5 years ago
  Rajalakshmi Srinivasaraghavan f77b6a83f4 dgemv optimization for POWER10 5 years ago
  Rajalakshmi Srinivasaraghavan d23419accc powerpc: Optimized SHGEMM kernel for POWER10 5 years ago
  Gordon Fossum bb2f52844b powerpc: Optimized ZGEMM kernel for POWER10 5 years ago
  Rajalakshmi Srinivasaraghavan 571eadb880 powerpc: Optimized SGEMM/DGEMM/CGEMM for POWER10 5 years ago
  Rajalakshmi Srinivasaraghavan 9fe930f205 powerpc: Add support for future processor 5 years ago