5122 Commits (d7ba7679b6a45dafdbdc4d4709ab49e2c0ec54cf)
 

Author SHA1 Message Date
  Martin Kroeker b8ebfc9335
Update system.cmake 5 years ago
  Martin Kroeker 7c1986640b
fallback from cooperlake to skylake if gcc<10 5 years ago
  Martin Kroeker 71d33c952d
Typo fix 5 years ago
  Martin Kroeker 6a3c074786
-march=cooperlake requires gcc10 5 years ago
  Martin Kroeker 430f741b30
-march=cooperlake requires gcc10 5 years ago
  Martin Kroeker 6f4dc7445d
Fix typo 5 years ago
  Martin Kroeker 81fbe8d088
-march=cooperlake only available in gcc >= 10 5 years ago
  Martin Kroeker bb9cf766f5
make march=cooperlake option conditional on gcc >= 10.1 5 years ago
  Martin Kroeker 75eeb265d7
[WIP] Refactor the driver code for direct SGEMM (#2782) 5 years ago
  Martin Kroeker 2c72972570
Merge pull request #2785 from albertziegenhagel/always-generate-pkg-config 5 years ago
  Albert Ziegenhagel 6b731d917f Do not require pkg-config to generate the *.pc file 5 years ago
  Martin Kroeker 5dcf47cd97
Merge pull request #2784 from martin-frbg/issue2783 5 years ago
  Martin Kroeker aa286e301b
Add typedef for bfloat16 if needed 5 years ago
  Martin Kroeker 9f0ef9cdfc
Merge pull request #77 from xianyi/develop 5 years ago
  Martin Kroeker 6bfc66663c
revert 5 years ago
  Martin Kroeker a8c6fb9e1c
revert 5 years ago
  Martin Kroeker 5ec8f716cf
revert 5 years ago
  Martin Kroeker 82f8a0aeba
Update .drone.yml 5 years ago
  Martin Kroeker d57d503c15
Update Makefile 5 years ago
  Martin Kroeker 37ac23e8a3
Add simple MT sgemm precision test and INTERFACE64 build 5 years ago
  Martin Kroeker 6a93e3b2ba
Add simple sgemm preicsion test 5 years ago
  Martin Kroeker 47ce1dd08f
Update gemm64.cpp 5 years ago
  Martin Kroeker f5fcc5baec
Add trivial gemm test for multithread consistency 5 years ago
  Martin Kroeker 597010a968
Fix incorrect argument to SLASET 5 years ago
  Martin Kroeker d64f1ef26b
Fix incorrect argument to SLASET 5 years ago
  Martin Kroeker c62aad62e5
Fix incorrect calls to DLASET 5 years ago
  Chen, Guobing e740c4873d Enable COOPERLAKE build target 5 years ago
  Martin Kroeker efdd237a91
Add a dedicated POWER9 build to the Travis CI (#2774) 5 years ago
  Martin Kroeker 4573cb2f43
Merge pull request #2765 from martin-frbg/issue2760 5 years ago
  Martin Kroeker 2a4bb797db
Merge pull request #2773 from martin-frbg/issue2770 5 years ago
  Martin Kroeker cbbe38bb88
Merge pull request #2772 from mhillenibm/s390x_gemm_tuning 5 years ago
  Martin Kroeker 619343278d
Fix mishandling of NO_CBLAS=0 and NO_LAPACKE=0 5 years ago
  Martin Kroeker fee361ae64
fix another source of NO_CBLAS=0 surprise 5 years ago
  Martin Kroeker 62f4c84f27
Merge pull request #76 from xianyi/develop 5 years ago
  Marius Hillenbrand e115c97e05 s390x/SGEMM: adjust default P and Q to multiples of M 5 years ago
  Marius Hillenbrand 07c334e7be s390x: Factor out small block sizes for SGEMM/DGEMM on z14 5 years ago
  Marius Hillenbrand e2828e30aa s390x: Optimize SGEMM/DGEMM blocks for z14 with explicit loop unrolling/interleaving 5 years ago
  Martin Kroeker 7219c9cb87
Merge pull request #2764 from martin-frbg/lapacktests 5 years ago
  Martin Kroeker c9d32674ea
Add memory barrier to the blas_lock implementation for Linux 5 years ago
  Martin Kroeker 64259d521a
Fix use of unallocated array in workspace query and wrong type of argument to xSCAL 5 years ago
  Martin Kroeker 6f5ca44c1a
Expand TAU array as SGEMQR/DGEMQR read elements 2 and 3 5 years ago
  Martin Kroeker d28b3f2776
Create Jenkinsfile for OSUOSL PowerCI 5 years ago
  Martin Kroeker ba3f7b3acf
Merge pull request #2761 from RajalakshmiSR/Makefile_err 5 years ago
  Rajalakshmi Srinivasaraghavan 475b5c95b9 Remove extra symbol in Makefile 5 years ago
  Martin Kroeker cd60080d4a
Merge pull request #2758 from martin-frbg/undef_shift 5 years ago
  Martin Kroeker 4847bfdddd
Merge pull request #2757 from martin-frbg/cmake64 5 years ago
  Martin Kroeker 81dcfdcf39
Multiply by 2 instead of left-shifting a potentially negative number 5 years ago
  Martin Kroeker 0ef4b3f1f2
Multiply instead of doing a left shift of a potentially negative number 5 years ago
  Martin Kroeker aa53a8a5cb
Multiply by two instead of left-shifting one place 5 years ago
  Martin Kroeker aa3a1e7d8c
Multiply by two rather than left shift by one place 5 years ago