239 Commits (437c0bf2b4697339d96c7bd0bb0bcdac09eccba1)

Author SHA1 Message Date
  Martin Kroeker 1688c7da43
change line endings from CRLF to LF 3 years ago
  Martin Kroeker 6c118b7977
Fix DNRM2 returning INF instead of zero due to intermediate overflow 3 years ago
  Martin Kroeker c43ec53bdd
Merge pull request #3690 from RajalakshmiSR/cdotp10 3 years ago
  Rajalakshmi Srinivasaraghavan a612e78a97 POWER: Fix complex dot function failures 3 years ago
  Rajalakshmi Srinivasaraghavan 432fd99445 POWER10: dgemv builtin rename 3 years ago
  VFerrari cac634fce3
POWER10: Fix multithreading check when USE_THREAD=0 3 years ago
  Martin Kroeker 9283c7c0b5
Merge pull request #3655 from RajalakshmiSR/zgemmasmp10 3 years ago
  Rajalakshmi Srinivasaraghavan f191bc652b POWER10: Fix ZGEMM testcase failures 3 years ago
  Rajalakshmi Srinivasaraghavan 8419d538ff POWER10: convert dgemv inline assembly 3 years ago
  Rajalakshmi Srinivasaraghavan b62173c5a0 POWER10: Changing store instructions for Level1 functions 3 years ago
  Martin Kroeker 05dcfa176e
fix undefined prefetchsizes 3 years ago
  Martin Kroeker 2bbb9f05c7
fix undefined prefetchsize 3 years ago
  Rafael Cardoso Fernandes Sousa c78fdcc80d [POWER] Add support for SMALL_MATRIX_OPT 4 years ago
  kavanabhat 9cc95e5657 AIX changes for P10 with GNU Compiler 4 years ago
  kavanabhat fe3c778c51 AIX changes for P10 with GNU Compiler 4 years ago
  Rafael Cardoso Fernandes Sousa b751edf624 Fix unused variable warnings on Power 4 years ago
  Rajalakshmi Srinivasaraghavan b06880c2cd POWER10: Improving dasum performance 4 years ago
  Martin Kroeker c4b464cac6
Merge pull request #3273 from austinpagan/sbgemm_gcc10_fix 4 years ago
  Gordon Fossum e6dd44d989 Power10: Fix for SBGEMM 4 years ago
  Martin Kroeker 2e8ff4a781
Merge pull request #3266 from martin-frbg/powerparam 4 years ago
  Martin Kroeker efdbdd8f82
Add prefetch values for power3 4 years ago
  Martin Kroeker 3906ef3b0f
Add prefetch values for power3 4 years ago
  Martin Kroeker 8adf0971d8
Add prefetch values for power3 4 years ago
  Martin Kroeker 08e2e60762
Add prefetch values for power3 4 years ago
  Martin Kroeker fb9e678235
Fix caxpy/zaxpy for big-endian 4 years ago
  Martin Kroeker dc4fcb48df
Fix inverted conditional for caxpy/zaxpy 4 years ago
  Martin Kroeker 7a48247761
fix c/zrot and sgemv for POWER5 4 years ago
  Rajalakshmi Srinivasaraghavan cbb70438df POWER10: Fixes for sbgemm kernel 4 years ago
  Rajalakshmi Srinivasaraghavan 2379abaa5e POWER10: Improve dgemm performance 4 years ago
  Rajalakshmi Srinivasaraghavan 55bb9f639a POWER10: Optimized zgemv 4 years ago
  Rajalakshmi Srinivasaraghavan 2dbcddd83d POWER10: Adding check for little endian 4 years ago
  Martin Kroeker 86c5a0013f
Add workaround for LAPACK testsuite failures with the NVIDIA HPC compiler 4 years ago
  Martin Kroeker ef85c22474
Add workaround for LAPACK test failures with the NVIDIA HPC compiler 4 years ago
  Martin Kroeker d3555d2e50
Add workaround for LAPACK test failures with the NVIDIA HPC compiler 4 years ago
  Rajalakshmi Srinivasaraghavan 09d47af2c0 Optimize zscal function for POWER10 4 years ago
  Rajalakshmi Srinivasaraghavan 41646ed006 Optimize s/dasum function for POWER10 4 years ago
  Rajalakshmi Srinivasaraghavan 0571c3187b POWER10: Rename mma builtins 4 years ago
  Rajalakshmi Srinivasaraghavan 2056ffc227 Optimize cscal function for POWER10 5 years ago
  Rajalakshmi Srinivasaraghavan 3ede843d50 Optimize s/dscal function for POWER10 5 years ago
  Rajalakshmi Srinivasaraghavan 439b93f6d2 Optimize s/drot function for POWER10 5 years ago
  Rajalakshmi Srinivasaraghavan eff7c9166e Optimize cdot function for POWER10 5 years ago
  Rajalakshmi Srinivasaraghavan 601b711c78 Optimize swap function for POWER10 5 years ago
  Rajalakshmi Srinivasaraghavan 2fb11f873b POWER10: Improve copy performance 5 years ago
  Martin Kroeker 043128cbe5
Merge pull request #3029 from RajalakshmiSR/axpyp10 5 years ago
  Rajalakshmi Srinivasaraghavan 346e30a46a POWER10: Improve axpy performance 5 years ago
  Gordon Fossum 213c0e7abb Added special unrolled vectorized versions of "Solve" for specific sizes, 5 years ago
  Rajalakshmi Srinivasaraghavan 7d46e31de1 POWER10: Optimize dgemv_n 5 years ago
  Rajalakshmi Srinivasaraghavan 6e364981a8 Optimize sdot/ddot for POWER10 5 years ago
  Rajalakshmi Srinivasaraghavan dd7a9cc5bf POWER10: Change dgemm unroll factors 5 years ago
  Rajalakshmi Srinivasaraghavan b435491885 Optimize caxpy for POWER10 5 years ago