18 Commits (45fdf951b64aa9145996727ecda901f00a2eda3c)

Author SHA1 Message Date
  Rajalakshmi Srinivasaraghavan 55bb9f639a POWER10: Optimized zgemv 4 years ago
  Martin Kroeker 86c5a0013f
Add workaround for LAPACK testsuite failures with the NVIDIA HPC compiler 4 years ago
  Rajalakshmi Srinivasaraghavan eff7c9166e Optimize cdot function for POWER10 5 years ago
  Gordon Fossum 213c0e7abb Added special unrolled vectorized versions of "Solve" for specific sizes, 5 years ago
  Rajalakshmi Srinivasaraghavan 6e364981a8 Optimize sdot/ddot for POWER10 5 years ago
  Rajalakshmi Srinivasaraghavan dd7a9cc5bf POWER10: Change dgemm unroll factors 5 years ago
  Rajalakshmi Srinivasaraghavan b435491885 Optimize caxpy for POWER10 5 years ago
  Rajalakshmi Srinivasaraghavan c24ba8b1dd Optimize saxpy for POWER10 5 years ago
  Rajalakshmi Srinivasaraghavan ad745c0bae Optimize scopy/ccopy for POWER10 5 years ago
  Rajalakshmi Srinivasaraghavan 0826d68f93 POWER10: Change the packing format for bfloat16 5 years ago
  Martin Kroeker 2061f7fdff
Rename "HALF" and "sh" to "BFLOAT16" and "sb" 5 years ago
  Rajalakshmi Srinivasaraghavan 2df4235e00 Optimize dcopy/zcopy for POWER10 5 years ago
  Rajalakshmi Srinivasaraghavan be43d2cb96 Optimize daxpy/zaxpy for POWER10 5 years ago
  Rajalakshmi Srinivasaraghavan f77b6a83f4 dgemv optimization for POWER10 5 years ago
  Rajalakshmi Srinivasaraghavan d23419accc powerpc: Optimized SHGEMM kernel for POWER10 5 years ago
  Gordon Fossum bb2f52844b powerpc: Optimized ZGEMM kernel for POWER10 5 years ago
  Rajalakshmi Srinivasaraghavan 571eadb880 powerpc: Optimized SGEMM/DGEMM/CGEMM for POWER10 5 years ago
  Rajalakshmi Srinivasaraghavan 9fe930f205 powerpc: Add support for future processor 5 years ago