95 Commits (0dea52ecbf982b0ad2c5ae5dc2dea114dddb388a)

Author SHA1 Message Date
  Martin Kroeker 0dea52ecbf
Update zscal.c 1 year ago
  Martin Kroeker 18272263ad
Update zscal.c 1 year ago
  Martin Kroeker 9ba40a0c12
Update zscal.c 1 year ago
  Martin Kroeker 4a226732d8
Handle INF and NAN 1 year ago
  Martin Kroeker 37e3c5ff9d
Update zscal.c 1 year ago
  Martin Kroeker 7caa35666d
Update zscal.c 1 year ago
  Martin Kroeker 3aab164f02
fix index variable 1 year ago
  Martin Kroeker bb992395f0
cannot use isnan here as need to check for INF as well 1 year ago
  Martin Kroeker a842008ab4
Update zscal.c 1 year ago
  Martin Kroeker 892b65c22d
Update cscal.c 1 year ago
  Martin Kroeker e6a6296d13
Update cscal.c 1 year ago
  Martin Kroeker 2c36a48130
Update zscal.c 1 year ago
  Martin Kroeker bde5b902aa
Update dscal.c 1 year ago
  Martin Kroeker 91f656cd76
handle INF and NAN 1 year ago
  Martin Kroeker 6843ec7732
handle INF and NAN in input 1 year ago
  Martin Kroeker ee09657cf1
make NAN handling depend on DUMMY2 argument 1 year ago
  Martin Kroeker 8e872a91a9
Fix erroneous mapping of SUM kernels to ASUM 2 years ago
  Martin Kroeker 20413ee6ec
Update zscal.c 2 years ago
  Martin Kroeker b57627c27f
Handle NAN and INF 2 years ago
  Marius Hillenbrand 22aa81f3e5 s390x: fix cscal and zscal implementations 5 years ago
  Marius Hillenbrand f91057cbad s390x: move common vector definitions and utils into header 5 years ago
  Marius Hillenbrand 2ee5b899ce s390x: enable S/DGEMM block with explicit loop unrolling + interleaving with clang 5 years ago
  Marius Hillenbrand 87e5bbd887 s390x: avoid variable-length arrays in struct for asm operands 5 years ago
  Marius Hillenbrand b9b3265ec8 s390x: avoid inline assembly for vector loads for clang 5 years ago
  Marius Hillenbrand a1616a0b86 s390x: replace nop with "nop 0" in inline assembly 5 years ago
  Marius Hillenbrand 60ef193258 s390x: use "lghi" for immediate values to fix build with clang 5 years ago
  Marius Hillenbrand 07c334e7be s390x: Factor out small block sizes for SGEMM/DGEMM on z14 5 years ago
  Marius Hillenbrand e2828e30aa s390x: Optimize SGEMM/DGEMM blocks for z14 with explicit loop unrolling/interleaving 5 years ago
  Marius Hillenbrand 89fe17f20e s390x: Use new sgemm kernel also for DGEMM and DTRMM on Z14 6 years ago
  Marius Hillenbrand bdd795ed03 s390x/GEMM: replace 0-init with peeled first iteration 6 years ago
  Marius Hillenbrand 2840432e49 s390x: improvise vector alignment hints for older compilers 6 years ago
  Marius Hillenbrand 1b0b4349a1 s390x/Z14: Change register blocking for SGEMM to 16x4 6 years ago
  Marius Hillenbrand 71b6eaf459 s390x: Use new sgemm kernel also for strmm on Z14 and newer 6 years ago
  Marius Hillenbrand 43c0d4f312 s390x: Add vectorized sgemm kernel for Z14 and newer 6 years ago
  int_13h 96ad579428 add in runtime cpu detection for zarch (#2349) 6 years ago
  Andreas Arnez d117dfd505 Change bad usage of "asum" to "sum" in ZARCH versions of ?sum 6 years ago
  Martin Kroeker 246ca29679
Add ZARCH implementation of ?sum 7 years ago
  maamountki 0a54c98b9d
[ZARCH] Modify constraints 7 years ago
  maamountki bec54ae366
[ZARCH] Fix caxpy 7 years ago
  maamountki f583674109
[ZARCH] Fix cgemv_t_4 7 years ago
  maamountki 77fe70019f
[ZARCH] Fix constraints and source code formatting 7 years ago
  maamountki 7039770165
[ZARCH] Undo the last commit 7 years ago
  maamountki 11a43e8116
[ZARCH] Set alignment hint for vl/vst 7 years ago
  maamountki 61526480f9
[ZARCH] Fix copy constraint 7 years ago
  maamountki 81daf6bc38
[ZARCH] Format source code, Fix constraints 7 years ago
  Martin Kroeker 874df65491
Fix incorrect sgemv results for IBM z14 7 years ago
  Martin Kroeker 877023e1e1
Fix precision of zarch DSDOT 7 years ago
  Martin Kroeker 265142edd5
Fix typo in the zarch min/max kernels 7 years ago
  maamountki 29416cb5a3
[ZARCH] Add Z13 version for max/min functions 7 years ago
  maamountki 48b9b94f7f
[ZARCH] Improve loading performance for camax/icamax 7 years ago