5416 Commits (b0beb0b1ca6469286dd69cdbeeb2c79d96ac66d0)
 

Author SHA1 Message Date
  Chen, Guobing b0beb0b1ca Initial code for Cooperlake BF16 GEMM kernel 5 years ago
  Martin Kroeker 5d6209e1f9
Merge pull request #3055 from RajalakshmiSR/swapp10 5 years ago
  Rajalakshmi Srinivasaraghavan 601b711c78 Optimize swap function for POWER10 5 years ago
  Martin Kroeker 78702753f2
Merge pull request #3053 from pkubaj/patch-1 5 years ago
  pkubaj 7aa1ff8ff6
Fix build on FreeBSD/powerpc64le 5 years ago
  Martin Kroeker d6c97cf010
Merge pull request #3052 from ashwinyes/arm64_fix_nrm2 5 years ago
  Ashwin Sekhar T K 1b2508362b arm64: Fix nrm2 for input vectors with Inf 5 years ago
  Martin Kroeker cd898af59f
Merge pull request #3050 from aurel32/riscv64-openblas-supported 5 years ago
  Aurelien Jarno 0a535e58d8 getarch.c: define OPENBLAS_SUPPORTED for riscv64 5 years ago
  Martin Kroeker 9ce9e295fe
Merge pull request #3049 from martin-frbg/readme 5 years ago
  Martin Kroeker 9a38592c79
Add pointers to the netlib documentation and Gilbert Strang's linear algebra primers 5 years ago
  Martin Kroeker 9b3965b08c
Merge pull request #6 from xianyi/develop 5 years ago
  Martin Kroeker 531cb4f673
Merge pull request #3035 from Joshua-Ashton/patch-1 5 years ago
  Martin Kroeker 3559c5d7a2
Merge pull request #3048 from martin-frbg/issue2998 5 years ago
  Martin Kroeker 8631e2976a
Temporarily revert to the old nrm2 kernels 5 years ago
  Martin Kroeker 2768bc1764
Temporarily revert to the old nrm2 kernels 5 years ago
  Martin Kroeker 6f4698ee1f
Temporarily revert to the old nrm2 kernel 5 years ago
  Martin Kroeker 85e5165e98
Merge pull request #3046 from martin-frbg/nvidiasdk-ppc 5 years ago
  Martin Kroeker 17c16f2a71
Implement builtin_cpu_is and limit cpu choices to P8 and P9 for NVIDIA compilers 5 years ago
  Martin Kroeker 91c3f86c2b
NVIDIA compiler does not yet support POWER10 5 years ago
  Martin Kroeker 75b1f3becc
Limit POWERPC DYNAMIC_CORE list to P8 and P9 for NVIDIA compilers 5 years ago
  Martin Kroeker 07c5e549b2
Merge pull request #3045 from martin-frbg/nvidiasdk 5 years ago
  Martin Kroeker 114eb159a4
Disable FMA intrinsics in the srot kernel when the compiler is PGI/NVIDIA 5 years ago
  Martin Kroeker 005cce5507
Amend SkylakeX options to support the NVIDIA compiler 5 years ago
  Martin Kroeker b859b6e79d
Add nvfortran 5 years ago
  Martin Kroeker b212a2fb9f
Add/modify "PGI" compiler options for NVIDIA SDK 20.11 5 years ago
  Martin Kroeker e40416567a
Add version printout for PGI/NVIDIA compiler 5 years ago
  Martin Kroeker b37e5fa2f8
Merge pull request #5 from xianyi/develop 5 years ago
  Martin Kroeker 326469ef4a
Merge pull request #3042 from martin-frbg/develop 5 years ago
  Martin Kroeker c73d8ee40d
Conditionally add -mfma to compiler options where needed 5 years ago
  Martin Kroeker abef2ea770
Move -fma option setting to kernel/Makefile.L1 5 years ago
  Martin Kroeker b26e32c3af
Merge pull request #3040 from martin-frbg/fixfcheck 5 years ago
  Martin Kroeker 7822eff936
Merge pull request #3038 from martin-frbg/issue3037 5 years ago
  Martin Kroeker b03dc011be
Fix undefined CC variable in clang check 5 years ago
  Martin Kroeker 00ce35336e
Fix spurious removal of a trailing character from the hostarch string on x86_64 5 years ago
  Martin Kroeker 723776ddf7
Merge pull request #4 from xianyi/develop 5 years ago
  Martin Kroeker 5a77ec7f1c
Merge pull request #3036 from RajalakshmiSR/p10copyalign 5 years ago
  Rajalakshmi Srinivasaraghavan 2fb11f873b POWER10: Improve copy performance 5 years ago
  Joshie ad63647446
Define BLAS acronym in README 5 years ago
  Martin Kroeker 87315e8a8d
Update version to 0.3.13.dev 5 years ago
  Martin Kroeker 9031ebd7d5
Update version to 0.3.13.dev 5 years ago
  Martin Kroeker 12b41d5598
Merge pull request #3034 from xianyi/release-0.3.0 5 years ago
  Martin Kroeker d2b11c4777
Merge pull request #3033 from xianyi/develop 5 years ago
  Martin Kroeker 7bc0e4a2e0
Update version to 0.3.13 for release 5 years ago
  Martin Kroeker d3ec787f77
Update version to 0.3.13 for release 5 years ago
  Martin Kroeker 2c309c235d
Merge pull request #3031 from martin-frbg/changelog13 5 years ago
  Martin Kroeker 3dec81200c
Update Changelog.txt 5 years ago
  Martin Kroeker 737724607f
Merge pull request #3030 from martin-frbg/fix2994 5 years ago
  Martin Kroeker 77edf82c7f
Update Changelog.txt for 0.3.13 5 years ago
  Martin Kroeker 6232237dba
Make fallback from P10 to P9 conditional on suitable compiler 5 years ago