102 Commits (16772ed07e8036937e172cc3133cd3fcb1b4cbdb)

Author SHA1 Message Date
  Martin Kroeker 16772ed07e
Update cscal.c 1 year ago
  Martin Kroeker e54f43bb45
Update cscal.c 1 year ago
  Martin Kroeker c7285a1406
Update cscal.c 1 year ago
  Martin Kroeker 184a527165
Update cscal.c 1 year ago
  Martin Kroeker 4f9be6d843
Update cscal.c 1 year ago
  Martin Kroeker ae5bdb76fb
Update cscal.c 1 year ago
  Martin Kroeker 344b14a374
Update cscal.c 1 year ago
  Martin Kroeker 35256671de
Update cscal.c 1 year ago
  Martin Kroeker a1efb03610
Update cscal.c 1 year ago
  Martin Kroeker 80bf765839
Update cscal.c 1 year ago
  Martin Kroeker b1008985ae
Update cscal.c 1 year ago
  Martin Kroeker 41cd46c2a9
Update cscal.c 1 year ago
  Martin Kroeker 7b915870eb
Update cscal.c 1 year ago
  Martin Kroeker ef01810dde
Update cscal.c 1 year ago
  Martin Kroeker 234bba3810
Update cscal.c 1 year ago
  Martin Kroeker 62d8047c42
Update cscal.c 1 year ago
  Martin Kroeker 2a17540469
Update cscal.c 1 year ago
  Martin Kroeker 1c3fcfdbb3
Update cscal.c 1 year ago
  Martin Kroeker 3c150610b7
Update cscal.c 1 year ago
  Martin Kroeker b23efc5846
add handling of dummy2 flag 1 year ago
  Egbert Eich ea6515c4b3 On zarch don't produce objects from assembler with a writable stack section 1 year ago
  tingbo.liao 3c8df6358f Further rearranged the rotm kernel for the different architectures. 1 year ago
  Martin Kroeker edbf093c98
Update zarch SCAL kernels to handle INF and NAN arguments (#4829) 1 year ago
  Martin Kroeker 8e872a91a9
Fix erroneous mapping of SUM kernels to ASUM 2 years ago
  Martin Kroeker 20413ee6ec
Update zscal.c 2 years ago
  Martin Kroeker b57627c27f
Handle NAN and INF 2 years ago
  Marius Hillenbrand 22aa81f3e5 s390x: fix cscal and zscal implementations 5 years ago
  Marius Hillenbrand f91057cbad s390x: move common vector definitions and utils into header 5 years ago
  Marius Hillenbrand 2ee5b899ce s390x: enable S/DGEMM block with explicit loop unrolling + interleaving with clang 5 years ago
  Marius Hillenbrand 87e5bbd887 s390x: avoid variable-length arrays in struct for asm operands 5 years ago
  Marius Hillenbrand b9b3265ec8 s390x: avoid inline assembly for vector loads for clang 5 years ago
  Marius Hillenbrand a1616a0b86 s390x: replace nop with "nop 0" in inline assembly 5 years ago
  Marius Hillenbrand 60ef193258 s390x: use "lghi" for immediate values to fix build with clang 5 years ago
  Marius Hillenbrand 07c334e7be s390x: Factor out small block sizes for SGEMM/DGEMM on z14 5 years ago
  Marius Hillenbrand e2828e30aa s390x: Optimize SGEMM/DGEMM blocks for z14 with explicit loop unrolling/interleaving 5 years ago
  Marius Hillenbrand 89fe17f20e s390x: Use new sgemm kernel also for DGEMM and DTRMM on Z14 6 years ago
  Marius Hillenbrand bdd795ed03 s390x/GEMM: replace 0-init with peeled first iteration 6 years ago
  Marius Hillenbrand 2840432e49 s390x: improvise vector alignment hints for older compilers 6 years ago
  Marius Hillenbrand 1b0b4349a1 s390x/Z14: Change register blocking for SGEMM to 16x4 6 years ago
  Marius Hillenbrand 71b6eaf459 s390x: Use new sgemm kernel also for strmm on Z14 and newer 6 years ago
  Marius Hillenbrand 43c0d4f312 s390x: Add vectorized sgemm kernel for Z14 and newer 6 years ago
  int_13h 96ad579428 add in runtime cpu detection for zarch (#2349) 6 years ago
  Andreas Arnez d117dfd505 Change bad usage of "asum" to "sum" in ZARCH versions of ?sum 6 years ago
  Martin Kroeker 246ca29679
Add ZARCH implementation of ?sum 7 years ago
  maamountki 0a54c98b9d
[ZARCH] Modify constraints 7 years ago
  maamountki bec54ae366
[ZARCH] Fix caxpy 7 years ago
  maamountki f583674109
[ZARCH] Fix cgemv_t_4 7 years ago
  maamountki 77fe70019f
[ZARCH] Fix constraints and source code formatting 7 years ago
  maamountki 7039770165
[ZARCH] Undo the last commit 7 years ago
  maamountki 11a43e8116
[ZARCH] Set alignment hint for vl/vst 7 years ago