10 Commits (d7ba7679b6a45dafdbdc4d4709ab49e2c0ec54cf)

Author SHA1 Message Date
  Marius Hillenbrand f91057cbad s390x: move common vector definitions and utils into header 5 years ago
  Marius Hillenbrand 2ee5b899ce s390x: enable S/DGEMM block with explicit loop unrolling + interleaving with clang 5 years ago
  Marius Hillenbrand b9b3265ec8 s390x: avoid inline assembly for vector loads for clang 5 years ago
  Marius Hillenbrand 07c334e7be s390x: Factor out small block sizes for SGEMM/DGEMM on z14 5 years ago
  Marius Hillenbrand e2828e30aa s390x: Optimize SGEMM/DGEMM blocks for z14 with explicit loop unrolling/interleaving 5 years ago
  Marius Hillenbrand bdd795ed03 s390x/GEMM: replace 0-init with peeled first iteration 6 years ago
  Marius Hillenbrand 2840432e49 s390x: improvise vector alignment hints for older compilers 6 years ago
  Marius Hillenbrand 1b0b4349a1 s390x/Z14: Change register blocking for SGEMM to 16x4 6 years ago
  Marius Hillenbrand 71b6eaf459 s390x: Use new sgemm kernel also for strmm on Z14 and newer 6 years ago
  Marius Hillenbrand 43c0d4f312 s390x: Add vectorized sgemm kernel for Z14 and newer 6 years ago