4796 Commits (aa286e301b3a4883970107eddb8f02744bdf2fc9)
 

Author SHA1 Message Date
  Martin Kroeker aa286e301b
Add typedef for bfloat16 if needed 5 years ago
  Martin Kroeker 9f0ef9cdfc
Merge pull request #77 from xianyi/develop 5 years ago
  Martin Kroeker 6bfc66663c
revert 5 years ago
  Martin Kroeker a8c6fb9e1c
revert 5 years ago
  Martin Kroeker 5ec8f716cf
revert 5 years ago
  Martin Kroeker 82f8a0aeba
Update .drone.yml 5 years ago
  Martin Kroeker d57d503c15
Update Makefile 5 years ago
  Martin Kroeker 37ac23e8a3
Add simple MT sgemm precision test and INTERFACE64 build 5 years ago
  Martin Kroeker 6a93e3b2ba
Add simple sgemm preicsion test 5 years ago
  Martin Kroeker 47ce1dd08f
Update gemm64.cpp 5 years ago
  Martin Kroeker f5fcc5baec
Add trivial gemm test for multithread consistency 5 years ago
  Martin Kroeker efdd237a91
Add a dedicated POWER9 build to the Travis CI (#2774) 5 years ago
  Martin Kroeker 4573cb2f43
Merge pull request #2765 from martin-frbg/issue2760 5 years ago
  Martin Kroeker 2a4bb797db
Merge pull request #2773 from martin-frbg/issue2770 5 years ago
  Martin Kroeker cbbe38bb88
Merge pull request #2772 from mhillenibm/s390x_gemm_tuning 5 years ago
  Martin Kroeker 619343278d
Fix mishandling of NO_CBLAS=0 and NO_LAPACKE=0 5 years ago
  Martin Kroeker fee361ae64
fix another source of NO_CBLAS=0 surprise 5 years ago
  Martin Kroeker 62f4c84f27
Merge pull request #76 from xianyi/develop 5 years ago
  Marius Hillenbrand e115c97e05 s390x/SGEMM: adjust default P and Q to multiples of M 5 years ago
  Marius Hillenbrand 07c334e7be s390x: Factor out small block sizes for SGEMM/DGEMM on z14 5 years ago
  Marius Hillenbrand e2828e30aa s390x: Optimize SGEMM/DGEMM blocks for z14 with explicit loop unrolling/interleaving 5 years ago
  Martin Kroeker 7219c9cb87
Merge pull request #2764 from martin-frbg/lapacktests 5 years ago
  Martin Kroeker c9d32674ea
Add memory barrier to the blas_lock implementation for Linux 5 years ago
  Martin Kroeker 64259d521a
Fix use of unallocated array in workspace query and wrong type of argument to xSCAL 5 years ago
  Martin Kroeker 6f5ca44c1a
Expand TAU array as SGEMQR/DGEMQR read elements 2 and 3 5 years ago
  Martin Kroeker d28b3f2776
Create Jenkinsfile for OSUOSL PowerCI 5 years ago
  Martin Kroeker ba3f7b3acf
Merge pull request #2761 from RajalakshmiSR/Makefile_err 5 years ago
  Rajalakshmi Srinivasaraghavan 475b5c95b9 Remove extra symbol in Makefile 5 years ago
  Martin Kroeker cd60080d4a
Merge pull request #2758 from martin-frbg/undef_shift 5 years ago
  Martin Kroeker 4847bfdddd
Merge pull request #2757 from martin-frbg/cmake64 5 years ago
  Martin Kroeker 81dcfdcf39
Multiply by 2 instead of left-shifting a potentially negative number 5 years ago
  Martin Kroeker 0ef4b3f1f2
Multiply instead of doing a left shift of a potentially negative number 5 years ago
  Martin Kroeker aa53a8a5cb
Multiply by two instead of left-shifting one place 5 years ago
  Martin Kroeker aa3a1e7d8c
Multiply by two rather than left shift by one place 5 years ago
  Martin Kroeker aaf1a17168
Apply current library name suffix 5 years ago
  Martin Kroeker 53add6a80d
Apply library name suffix to openblas if any 5 years ago
  Martin Kroeker 9eb897cc01
Merge pull request #75 from xianyi/develop 5 years ago
  Martin Kroeker 7cead56258
Merge pull request #2753 from martin-frbg/issue2751 5 years ago
  Martin Kroeker 6794ac3415
Add SYMBOLPREFIX and/or -SUFFIX to cblas.h if needed 5 years ago
  Martin Kroeker ecf4b9e0fc
Improve substitution rules for SYMBOLPREFIX and -SUFFIX addition 5 years ago
  Martin Kroeker dfe5d09641
Merge pull request #2756 from martin-frbg/issue2755 5 years ago
  Martin Kroeker 60cd5e55fc
Protect against inadvertent activation of USE_CUDA 5 years ago
  Martin Kroeker da9e2a7ada
Add SYMBOLPREFIX and/or SYMBOLSUFFIX to cblas prototypes 5 years ago
  Martin Kroeker c88cbc5e0d
Merge pull request #2752 from kadler/cpuid_aix 5 years ago
  Kevin Adler 589c74aed3
Use systemcfg APIs for CPU detection on AIX 5 years ago
  Martin Kroeker 104aa678b0
Fix inadvertent version number reversal to 0.3.9.dev caused by #2710 5 years ago
  Martin Kroeker c6b48e0394
Merge pull request #2749 from martin-frbg/make_ppc 5 years ago
  Martin Kroeker 4927251298
Merge pull request #2750 from RajalakshmiSR/dgemv_p10 5 years ago
  Rajalakshmi Srinivasaraghavan f77b6a83f4 dgemv optimization for POWER10 5 years ago
  Martin Kroeker 39724e8128
Separate OpenMP handling and allow compilation of Power9 code with older gcc 5 years ago