4797 Commits (a7fc14c501d2258d43bf0fb1f0d2d949e7ef7cec)
 

Author SHA1 Message Date
  Martin Kroeker a7fc14c501
Limit direct sgemm to x86_64 5 years ago
  Martin Kroeker b86214e434
Limit direct sgemm to x86_64 5 years ago
  Martin Kroeker 1ba18212da
Update common_s.h 5 years ago
  Martin Kroeker 5a0e9e8ded
Update setparam-ref.c 5 years ago
  Martin Kroeker e46d761bca
Update setparam-ref.c 5 years ago
  Martin Kroeker 6c279ef552
Update setparam-ref.c 5 years ago
  Martin Kroeker 7996458ea1
Update common_s.h 5 years ago
  Martin Kroeker 7fe38daee5
use macros for sgemm_direct to support dynamic_arch naming via common_s,h 5 years ago
  Martin Kroeker af80849063
Add sgemm_direct 5 years ago
  Martin Kroeker 54e02aaf11
Update gemm.c 5 years ago
  Martin Kroeker a83cb3966d
Refactor sgemm_direct 5 years ago
  Martin Kroeker 5a74bd45fd
remove include as sgemm_direct is handled at the makefile level now 5 years ago
  Martin Kroeker 56d4d4f84b
Move sgemm_direct_performant helper to separate file 5 years ago
  Martin Kroeker 2586b26e29
Add direct_sgemm 5 years ago
  Martin Kroeker 86e3455d02
Add sgemm_direct targets 5 years ago
  Martin Kroeker 774029af38
move sgemm_direct function declarations 5 years ago
  Martin Kroeker 82f8a0aeba
Update .drone.yml 5 years ago
  Martin Kroeker d57d503c15
Update Makefile 5 years ago
  Martin Kroeker 37ac23e8a3
Add simple MT sgemm precision test and INTERFACE64 build 5 years ago
  Martin Kroeker 6a93e3b2ba
Add simple sgemm preicsion test 5 years ago
  Martin Kroeker 47ce1dd08f
Update gemm64.cpp 5 years ago
  Martin Kroeker f5fcc5baec
Add trivial gemm test for multithread consistency 5 years ago
  Martin Kroeker 62f4c84f27
Merge pull request #76 from xianyi/develop 5 years ago
  Martin Kroeker 7219c9cb87
Merge pull request #2764 from martin-frbg/lapacktests 5 years ago
  Martin Kroeker 64259d521a
Fix use of unallocated array in workspace query and wrong type of argument to xSCAL 5 years ago
  Martin Kroeker 6f5ca44c1a
Expand TAU array as SGEMQR/DGEMQR read elements 2 and 3 5 years ago
  Martin Kroeker d28b3f2776
Create Jenkinsfile for OSUOSL PowerCI 5 years ago
  Martin Kroeker ba3f7b3acf
Merge pull request #2761 from RajalakshmiSR/Makefile_err 5 years ago
  Rajalakshmi Srinivasaraghavan 475b5c95b9 Remove extra symbol in Makefile 5 years ago
  Martin Kroeker cd60080d4a
Merge pull request #2758 from martin-frbg/undef_shift 5 years ago
  Martin Kroeker 4847bfdddd
Merge pull request #2757 from martin-frbg/cmake64 5 years ago
  Martin Kroeker 81dcfdcf39
Multiply by 2 instead of left-shifting a potentially negative number 5 years ago
  Martin Kroeker 0ef4b3f1f2
Multiply instead of doing a left shift of a potentially negative number 5 years ago
  Martin Kroeker aa53a8a5cb
Multiply by two instead of left-shifting one place 5 years ago
  Martin Kroeker aa3a1e7d8c
Multiply by two rather than left shift by one place 5 years ago
  Martin Kroeker aaf1a17168
Apply current library name suffix 5 years ago
  Martin Kroeker 53add6a80d
Apply library name suffix to openblas if any 5 years ago
  Martin Kroeker 9eb897cc01
Merge pull request #75 from xianyi/develop 5 years ago
  Martin Kroeker 7cead56258
Merge pull request #2753 from martin-frbg/issue2751 5 years ago
  Martin Kroeker 6794ac3415
Add SYMBOLPREFIX and/or -SUFFIX to cblas.h if needed 5 years ago
  Martin Kroeker ecf4b9e0fc
Improve substitution rules for SYMBOLPREFIX and -SUFFIX addition 5 years ago
  Martin Kroeker dfe5d09641
Merge pull request #2756 from martin-frbg/issue2755 5 years ago
  Martin Kroeker 60cd5e55fc
Protect against inadvertent activation of USE_CUDA 5 years ago
  Martin Kroeker da9e2a7ada
Add SYMBOLPREFIX and/or SYMBOLSUFFIX to cblas prototypes 5 years ago
  Martin Kroeker c88cbc5e0d
Merge pull request #2752 from kadler/cpuid_aix 5 years ago
  Kevin Adler 589c74aed3
Use systemcfg APIs for CPU detection on AIX 5 years ago
  Martin Kroeker 104aa678b0
Fix inadvertent version number reversal to 0.3.9.dev caused by #2710 5 years ago
  Martin Kroeker c6b48e0394
Merge pull request #2749 from martin-frbg/make_ppc 5 years ago
  Martin Kroeker 4927251298
Merge pull request #2750 from RajalakshmiSR/dgemv_p10 5 years ago
  Rajalakshmi Srinivasaraghavan f77b6a83f4 dgemv optimization for POWER10 5 years ago