132 Commits (303bdb673b8ef7b9e2ccdbb331827e00a1293951)

Author SHA1 Message Date
  Martin Kroeker 1a6ea8ee6d
Merge pull request #2338 from kavanabhat/aix_mod 6 years ago
  Martin Kroeker dd04143d4a
Merge pull request #2328 from martin-frbg/ppc9 6 years ago
  Martin Kroeker dedd822d1a
Fix caxpy/caxpyc naming in localentry 6 years ago
  Martin Kroeker 2181fb7047
Fix caxpy/caxpyc naming in localentry 6 years ago
  Martin Kroeker a9b62c03f8
Substitute precompiled gcc7 codes only when gcc is older than 9.x 6 years ago
  Anton Blanchard cf2a8e410c Fix SEGV in cdot_power9 6 years ago
  Martin Kroeker 08fa83aba2
Merge pull request #2312 from martin-frbg/power8be 6 years ago
  Martin Kroeker cad0d150db
Define alternate kernels for big-endian POWER8 6 years ago
  Martin Kroeker eba0aeb7cd
Fix compilation for big-endian POWER8 6 years ago
  Martin Kroeker 0c07c356c1
Define alternate kernels for big-endian PPC440 6 years ago
  Martin Kroeker b3ac6ee222
Define alternate kernels for big-endian PPC970 6 years ago
  Martin Kroeker 68597002ea
The assembly microkernel is not safe to use on ELFv1 6 years ago
  Martin Kroeker d2a6285549
The assembly microkernel is not safe to use on ELFv1 6 years ago
  Martin Kroeker d999688d1a
The assembly microkernel is not safe to use on ELFv1 6 years ago
  Martin Kroeker 928fe1b28e
The assembly microkernel is not safe to use on ELFv1 6 years ago
  Martin Kroeker 5e244d80f2
Merge pull request #2271 from quickwritereader/strmm_fix 6 years ago
  AbdelRauf ede5efebab trmm fix 6 years ago
  Martin Kroeker 596a22325a
Fix prologue of power9 assembly cdot(c) kernel to provide cdotc 6 years ago
  Martin Kroeker 7f58f3ad0e
Fix mis-edits in the gcc-derived power8 caxpy kernel 6 years ago
  Martin Kroeker 673e5a0495
Replace several POWER8/9 C kernels with their gcc7-generated assembly versions (#2263) 6 years ago
  Martin Kroeker f3c314550c
Merge pull request #2243 from quickwritereader/develop 6 years ago
  AbdelRauf 847c20c9b7 fix uninitialized variables i 6 years ago
  AbdelRauf 4c22828812 caxpy and cdot are using vec_vsx_ld 6 years ago
  AbdelRauf e79712d969 cgemv using vec_vsx_ld instead of letting gcc to decide 6 years ago
  AbdelRauf be09551cdf aligned 6 years ago
  Kavana Bhat 3dc6b26eff AIX changes for Power8 6 years ago
  Martin Kroeker 6b6c9b1441
Merge pull request #2172 from quickwritereader/develop 6 years ago
  AbdelRauf a97b301aaa cgemm/ctrmm power9 6 years ago
  Piotr Kubaj eebfeba768 Fix build on FreeBSD/powerpc64. 6 years ago
  kavanabhat a575f1e4c7
Update dtrmm_kernel_16x4_power8.S 6 years ago
  AbdelRauf cdbfb891da new sgemm 8x16 6 years ago
  Martin Kroeker a17cf36225
Merge pull request #2153 from quickwritereader/develop 6 years ago
  AbdelRauf 148c4cc5fd conflict resolve 6 years ago
  AbdelRauf d0c3543c3f power9 zgemm ztrmm optimized 6 years ago
  AbdelRauf a469b32cf4 sgemm pipeline improved, zgemm rewritten without inner packs, ABI lxvx v20 fixed with vs52 6 years ago
  AbdelRauf 8fe794f059 improved zgemm power9 based on power8 6 years ago
  Martin Kroeker 3f427c0cf9
Merge pull request #2107 from quickwritereader/develop 6 years ago
  AbdelRauf 47f892198c conflict resolve 6 years ago
  AbdelRauf 628b335e83 Merge branch 'develop' of https://github.com/quickwritereader/OpenBLAS into develop 6 years ago
  AbdelRauf 0f105dd8a5 sgemm/strmm 6 years ago
  Martin Kroeker ccfb7ead15
Merge pull request #2072 from martin-frbg/sum 6 years ago
  Rashmica Gupta bcdf1d4917 Add in runtime CPU detection for POWER. 6 years ago
  Martin Kroeker 706dfe263b
Add POWER implementation of ?sum 6 years ago
  Martin Kroeker 7c51cc8527
Merge branch 'develop' into develop 6 years ago
  AbdelRauf 853a18bc17 power9 makefile. dgemm based on power8 kernel with following changes : 32x unrolled 16x4 kernel and 8x4 kernel using (lxv stxv butterfly rank1 update). improvement from 17 to 22-23gflops. dtrmm cases were added into dgemm itself 6 years ago
  Martin Kroeker 718efcec6f
Fix out-of-bounds memory access in gemm_beta 7 years ago
  Martin Kroeker f9d67bb5e8
Fix out-of-bounds memory access in gemm_beta 7 years ago
  Ubuntu 498ac98581 Note for unused kernels 7 years ago
  Ubuntu cd9ea45463 NBMAX=4096 for gemvn, added sgemvn 8x8 for future 7 years ago
  Ubuntu 4abc375a91 sgemv cgemv pairs 7 years ago