176 Commits (2df4235e00a73ad61b7997c74497fd86eb278ebf)

Author SHA1 Message Date
  Rajalakshmi Srinivasaraghavan 2df4235e00 Optimize dcopy/zcopy for POWER10 5 years ago
  Rajalakshmi Srinivasaraghavan be43d2cb96 Optimize daxpy/zaxpy for POWER10 5 years ago
  Rajalakshmi Srinivasaraghavan 317ff27cda POWER10: Avoid setting accumulators to zero in gemm kernels 5 years ago
  Rajalakshmi Srinivasaraghavan f77b6a83f4 dgemv optimization for POWER10 5 years ago
  Rajalakshmi Srinivasaraghavan d557584b71 Fix compilation issues with clang on POWER 5 years ago
  Rajalakshmi Srinivasaraghavan 9be2688c78 Fix to store results in correct order for POWER10 GEMM kernels 5 years ago
  Martin Kroeker 6a2a60038c
Merge pull request #2720 from martin-frbg/issue2694 5 years ago
  Martin Kroeker 251a09ec90
Typo fix 5 years ago
  Martin Kroeker 95d37e1575
Regroup the 32 and 64bit sections and restore 64bit CAXPY 5 years ago
  Martin Kroeker 3523bb778e
Merge pull request #2721 from martin-frbg/p8align 5 years ago
  Martin Kroeker ca3561cab9
Add ifdefs around call to altivec microkernel 5 years ago
  Martin Kroeker 21072e502a
Typo fix 5 years ago
  Martin Kroeker 661c6bfa5a
Exclude altivec code paths if the compiler does not support them 5 years ago
  Martin Kroeker 0033f8be0d
Use vec_vsx_ld/st to fix misaligned accesses flagged by asan 5 years ago
  Martin Kroeker f308e741b2
remove debug output and revert changes to cdot and crot 5 years ago
  Martin Kroeker f8c2697701
Use POWER6 GEMM, TRMM and DTRSM on 32bit POWER8 5 years ago
  EGuesnet 634e1305f9
Update cgemm_kernel_8x4_power8.S 5 years ago
  Rajalakshmi Srinivasaraghavan d23419accc powerpc: Optimized SHGEMM kernel for POWER10 5 years ago
  Gordon Fossum bb2f52844b powerpc: Optimized ZGEMM kernel for POWER10 5 years ago
  Rajalakshmi Srinivasaraghavan 571eadb880 powerpc: Optimized SGEMM/DGEMM/CGEMM for POWER10 5 years ago
  Rajalakshmi Srinivasaraghavan 9fe930f205 powerpc: Add support for future processor 5 years ago
  Martin Kroeker b1ee81228a
Change complex DOT and ROT to generic kernels and switch CGEMM 5 years ago
  Rajalakshmi Srinivasaraghavan bd9ff820bc Fix cmake compilation issue - POWER9 5 years ago
  Martin Kroeker 06208c8d01
Limit this fix to ELFv2 builds 5 years ago
  Martin Kroeker f5c4c28b98
Work around POWER8BE bugs on FreeBSD (ELFv2) 5 years ago
  Rajalakshmi Srinivasaraghavan 2afc074803 Fix DYNAMIC_ARCH build for POWER9 5 years ago
  Martin Kroeker 4f371b0fbf
Use POWER8 kernels on big-endian POWER9 for now 5 years ago
  Martin Kroeker 4046985913
Add proper defaults for IxMIN/IxMAX kernels 6 years ago
  Martin Kroeker 0b39cf95b0
Fix endianness conditionals 6 years ago
  Martin Kroeker 9f39f0a2c3
Specify ismin/ismax assembly kernels for POWER8 directly 6 years ago
  Martin Kroeker d483e9270a
Update KERNEL.POWER8 6 years ago
  Martin Kroeker 01834aee33
Merge pull request #29 from xianyi/develop 6 years ago
  Martin Kroeker d92bd5be24
Update KERNEL.POWER8 6 years ago
  Martin Kroeker 46e4b12946
Update KERNEL.POWER8 6 years ago
  Martin Kroeker cafdd999b8
Update caxpy_power8.S 6 years ago
  Martin Kroeker 92ca92a46c
Update caxpy_power8.S 6 years ago
  Martin Kroeker 486c35c5dc
Update icamin_power8.S 6 years ago
  Martin Kroeker 5ba3699f41
Update isamin_power8.S 6 years ago
  Martin Kroeker 8eefa530cd
Update isamax_power8.S 6 years ago
  Martin Kroeker de40d47edf
Update isamin_power8.S 6 years ago
  Martin Kroeker 7c162b8a21
Update isamax_power8.S 6 years ago
  Martin Kroeker 0544cbc806
Fix syntax of endianness conditional 6 years ago
  Martin Kroeker 120d20731f
Fix syntax of endianness conditional 6 years ago
  Martin Kroeker dc345d84df
Fix syntax of endianness conditional and add gcc version check for workaround 6 years ago
  Martin Kroeker 1a6ea8ee6d
Merge pull request #2338 from kavanabhat/aix_mod 6 years ago
  Martin Kroeker dd04143d4a
Merge pull request #2328 from martin-frbg/ppc9 6 years ago
  Martin Kroeker dedd822d1a
Fix caxpy/caxpyc naming in localentry 6 years ago
  Martin Kroeker 2181fb7047
Fix caxpy/caxpyc naming in localentry 6 years ago
  Martin Kroeker a9b62c03f8
Substitute precompiled gcc7 codes only when gcc is older than 9.x 6 years ago
  Anton Blanchard cf2a8e410c Fix SEGV in cdot_power9 6 years ago