101 Commits (a1eecccda28cf7d00a5ffbbcd5afb4ca6ef6c6a1)

Author SHA1 Message Date
  Martin Kroeker d3ff1f889f
Convert ifndefs to ifneq 5 years ago
  Rajalakshmi Srinivasaraghavan b5d30b390d Fix build issues with bfloat16 5 years ago
  Martin Kroeker 006c7f6671
Change "HALF" and "sh" to "BFLOAT16" and "sb" 5 years ago
  Martin Kroeker 886a8e3190
Adapt for supporting only a subset of variable types 5 years ago
  Martin Kroeker ac653c94f3
Merge branch 'develop' into issue2588-cmake 5 years ago
  Martin Kroeker 988a6f429e
Add BUILD_vartype defines 5 years ago
  Martin Kroeker e5e2fbd593
Support building only selected types 5 years ago
  y00512012 06cf73a239 fix a bug of trmm 5 years ago
  Martin Kroeker ddec244a5a
Merge pull request #2838 from austinpagan/gordon_trmm 5 years ago
  fossum dfeca46098 Adding performance patch for trmm, just like #2836 5 years ago
  fossum 274d6e015b Fixing a performance bug in trsm_[LR].c. 5 years ago
  Martin Kroeker 330044d821
Fix potentiol domain error in sqrt 5 years ago
  Chen, Guobing e740c4873d Enable COOPERLAKE build target 5 years ago
  Martin Kroeker ce45af8151
Update conditional for atomics to use HAVE_C11 5 years ago
  Martin Kroeker 6f38de06d2
Update conditional for atomics to use HAVE_C11 5 years ago
  Martin Kroeker 5dd14e3d48
Make building the bfloat16 functions conditional on option BUILD_HALF (#2590) 5 years ago
  Rajalakshmi Srinivasaraghavan 7eb55504b1 RFC : Add half precision gemm for bfloat16 in OpenBLAS 5 years ago
  Ali Saidi 97ce6bbce2 Fix barriers in level3_thread 6 years ago
  wjc404 2f96a2c55b
Update trmm_R.c 6 years ago
  wjc404 833bd0f8ff
Update trmm_L.c 6 years ago
  wjc404 77b8f49556
Update level3_thread.c 6 years ago
  wjc404 1c3e20ce48
Update level3.c 6 years ago
  wjc404 e9fb8f62b1
Update level3_gemm3m_thread.c 6 years ago
  wjc404 4c35b8dbaa
Update gemm3m_level3.c 6 years ago
  Martin Kroeker f3065a0eed
Fix race conditions in multithreaded GEMM3M 6 years ago
  Martin Kroeker f343ed65b5
Avoid taking the root of a negative number 7 years ago
  Martin Kroeker f72fdf525c
Merge pull request #1875 from martin-frbg/issue1851 7 years ago
  Martin Kroeker 113cb00b95
fix missing parenthesis 7 years ago
  Martin Kroeker 5192651706
Add CriticalSection handling instead of mutexes for Windows 7 years ago
  Martin Kroeker 2e6fae2aad
Serialize accesses to parallelized level3 functions from multiple callers 7 years ago
  Arjan van de Ven 5b708e5eb1 sgemm/dgemm: add a way for an arch kernel to specify prefered sizes 7 years ago
  Martin Kroeker 5f2a3c05cd
Revert "Rewrite &= -> = and simplify the initial blocking phase." 7 years ago
  Craig Donner 0144068537 Rewrite &= -> = and simplify the initial blocking phase. 7 years ago
  Arjan van de Ven 73de17664d Add missing barriers in gemm scheduler 7 years ago
  Arjan van de Ven d148ec4ea1 Don't use _Atomic for jobs sometimes... 7 years ago
  Arjan van de Ven 9e162146a9 Only initialize the part of the jobs array that will get used 7 years ago
  Martin Kroeker a91f1587b9
Work around name clash with Windows10's winnt.h 7 years ago
  Zhiyong Dang 3716267124 Change _STDC_VERSION__ to __STDC_VERSION__ 7 years ago
  Martin Kroeker 6a99fcce94
Use _Atomic instead of volatile for thread safety where C11 is supported 8 years ago
  Andrew 11a627c54e remove surplus parentheses to silence clang5 8 years ago
  Andrew bfc2a88594 remove unused buffer 8 years ago
  Andrew ef95cd471f elminate unread variable, after reiteration 3 of them (clang4) 8 years ago
  Martin Kroeker db72ad8f6a Merge pull request #1320 from timmoon10/develop 8 years ago
  Martin Kroeker 514d237257 Merge pull request #1279 from xsacha/develop 8 years ago
  Tim Moon 30486a356c Reduce number of data partitions in n. 8 years ago
  Tim Moon 9de52b489a Cleaning up and documenting multi-threaded GEMM code. 8 years ago
  Tim Moon 860dcfc703 Use 2D thread distribution for small GEMMs. 8 years ago
  Tim Moon 6aaa107865 Reducing threads for multi-threaded GEMMs on small matrices. 8 years ago
  Sacha Refshauge 37858d1146 Fix threading usage in CMake: s/SMP/USE_THREAD/ 8 years ago
  Isuru Fernando d245caa49a Support out-of-source build 8 years ago