401 Commits (018dec858852fb6859c4c1909c8b79374eae2a4f)

Author SHA1 Message Date
  Martin Kroeker 17c16f2a71
Implement builtin_cpu_is and limit cpu choices to P8 and P9 for NVIDIA compilers 5 years ago
  Martin Kroeker 6232237dba
Make fallback from P10 to P9 conditional on suitable compiler 5 years ago
  Martin Kroeker 18d8a67485
Merge pull request #2994 from antonblanchard/power10-fixes 5 years ago
  gxw 4b548857d6 Add msa support for loongson 5 years ago
  Martin Kroeker bc5b1ddf0d
Merge pull request #3004 from martin-frbg/bsd_getauxval 5 years ago
  Martin Kroeker e7bf8ced6c
Build fix for systems that do not support getauxval 5 years ago
  Martin Kroeker 5fa305172a
Use ifeq instead of ifdef for user-definable options 5 years ago
  Alexander Grund 60005eb47b
Don't overwrite blas_thread_buffer if already set 5 years ago
  Anton Blanchard 043f3d6faa POWER10: Use POWER9 as a fallback 5 years ago
  Martin Kroeker ff16329cb7
Merge pull request #2972 from xiegengxin/rot-intrinsic 5 years ago
  Gengxin Xie d9ba49165a Improve the performance of rot by using AVX512 and AVX2 intrinsic 5 years ago
  Martin Kroeker aa21cb5217
Merge pull request #2960 from thrasibule/avx2_detection 5 years ago
  Guillaume Horel 1f564d729b fix avx2 detection 5 years ago
  Chen, Guobing a7b1f9b1bb Implementation of BF16 based gemv 5 years ago
  Martin Kroeker 2207a16235
Merge pull request #2952 from martin-frbg/issue2931 5 years ago
  Martin Kroeker b937d78a6d
Try to read cpu information from /sys/devices/system/cpu/cpu0 if HWCAP_CPUID fails 5 years ago
  Martin Kroeker fd7da56965
Move definitions that are neither needed nor supported on SUNOS 5 years ago
  Martin Kroeker ff65952e46
Move HAVE_P10_SUPPORT to the build system 5 years ago
  Martin Kroeker 85154c2e18
Change "HALF" and "sh" to "BFLOAT16" and "sb" 5 years ago
  Martin Kroeker ac653c94f3
Merge branch 'develop' into issue2588-cmake 5 years ago
  Martin Kroeker f032d8966e
Merge pull request #2874 from Flamefire/memory_fixes 5 years ago
  Martin Kroeker f6e4cf2f9d
Merge pull request #2876 from Flamefire/omp_fork_fix 5 years ago
  User User-User d2333e7842 aarch64 fix std=c18 compilation 5 years ago
  Alexander Grund 3094fc6c83
Lazyly reinit threads after a fork in OMP mode 5 years ago
  Alexander Grund 3c05f54df8
Avoid out of bounds access on invalid memory free 5 years ago
  Alexander Grund dee7c49938
Fix TABs and trailing space 5 years ago
  Martin Kroeker 896bbd55e1
Add support for building only selected variable types 5 years ago
  Martin Kroeker 357bff06b5
Add BUILD_vartype defines 5 years ago
  Martin Kroeker 91c84e1c01
Merge pull request #2796 from Guobing-Chen/BF16_dot_coversion_apis 5 years ago
  Marius Hillenbrand a55fe06f25 s390x/DYNAMIC_ARCH: define a HW_CAP flag to support slightly older glibc versions 5 years ago
  Marius Hillenbrand 4f34bcfb5e s390x/DYNAMIC_ARCH: pass supported arch levels from Makefile to run-time code 5 years ago
  Chen, Guobing deaeb6c5b8 Add bfloat16 based dot and conversion with single/double 5 years ago
  Chen, Guobing 0c1c903f1e Fix OMP num specify issue 5 years ago
  Chen, Guobing e740c4873d Enable COOPERLAKE build target 5 years ago
  Martin Kroeker 60cd5e55fc
Protect against inadvertent activation of USE_CUDA 5 years ago
  Martin Kroeker 7c02f4b1f7
Merge pull request #2744 from martin-frbg/issue2738 5 years ago
  Martin Kroeker 12918358aa
Add AMD Renoir/Matisse and preliminary support for Zen3 as Zen2 5 years ago
  Ashwin Sekhar T K 4e1be0e481 ARM64: Add THUNDERX3T110 Target 5 years ago
  Martin Kroeker 09eb9d2584
Update conditional for atomics to HAVE_C11 5 years ago
  Martin Kroeker 791e046744
Update conditional for atomics to use HAVE_C11 5 years ago
  Martin Kroeker 94bab9d1f9
Update conditional for atomics to use HAVE_C11 5 years ago
  Rajalakshmi Srinivasaraghavan af1e140e35 Change minimum gcc version for POWER10 5 years ago
  Rajalakshmi Srinivasaraghavan 45d819ca82 Changing mcpu option as power10 5 years ago
  Martin Kroeker 584ef8d4ae
Add support for Comet Lake H & S 5 years ago
  Matthew Treinish f37e941d52
Add support to driver/others/dynamic.c too 5 years ago
  User User-User e6b9275034 address vs2019 C4293 5 years ago
  Martin Kroeker 6eaeb01263
Merge pull request #2658 from RajalakshmiSR/p10 5 years ago
  Martin Kroeker 007d9f97d7
Make gotoblas_corename report the name of the selected TARGET rather than its aliases 5 years ago
  Rajalakshmi Srinivasaraghavan 9fe930f205 powerpc: Add support for future processor 5 years ago
  Marius Hillenbrand 0dbe61a612 s390x: choose SIMD kernels at run-time based on OS and compiler support 5 years ago