455 Commits (b329e45288c2e7fc0ef15c4e8a7b3c8dfd74a930)

Author SHA1 Message Date
  Martin Kroeker b329e45288
Guard against omp_get_num_places returning zero 4 years ago
  Martin Kroeker 6ed52576f8
Add feature-based fallback for unknown x86_64 cpus 4 years ago
  Martin Kroeker 7a7fbb11c3
define "unlikely" on non-cygwin too 4 years ago
  Martin Kroeker b31349c22a
Open up delayed (re)init to non-Cygwin OS as well 4 years ago
  Martin Kroeker c8d05aa7a5
Move the threads overflow flag under the protection of the local blas lock (#3476) 4 years ago
  Rafael Cardoso Fernandes Sousa 214fbcee15 Fix cmake for power 4 years ago
  Martin Kroeker 4f057bffd6
Fix NULL pointer checks in blas_memory_alloc 4 years ago
  Martin Kroeker 08f8bb66c0
Add CPUIDs for Alder Lake and other recent Intel cpus 4 years ago
  Martin Kroeker efb16fafb0
Fix miscounting of threadpool size on Linux with OMP_PROC_BIND=TRUE (#3437) 4 years ago
  Marius Hillenbrand 77747bc536 cpuid_zarch/hwcaps: add documentation and dump hwcaps in init 4 years ago
  Martin Kroeker 22a616bd8f
Add model number for Tiger Lake H (mobile variant) 4 years ago
  Marius Hillenbrand 44950ca173 s390x: use DYNAMIC_ARCH's cpu detection for compile-time choice 4 years ago
  Wangyang Guo 3dc6052c7e initial support for Sapphire Rapids platform 4 years ago
  Rafael Cardoso Fernandes Sousa 0e8b4adf22 Remove unused commented code (#if directive) 4 years ago
  Martin Kroeker dd09f0173e
Remove extraneous qualifiers from struct definition 4 years ago
  Wangyang Guo 045ed5c91d sbgemm: fix build error in BFLOAT16 disabled 4 years ago
  Wangyang Guo 8356a604f0 sbgemm: cooperlake: tuning for block params 4 years ago
  Martin Kroeker cd10d1c03b
Fix typo 4 years ago
  Martin Kroeker 2db1a99aca
Clean up debug messages 4 years ago
  Martin Kroeker 89fc5b8f4f
Fix unmap logic 4 years ago
  Martin Kroeker 7fd12a5e69
Add likely() hints for gcc 4 years ago
  Martin Kroeker 2ba9a567aa
Fix typo 4 years ago
  Martin Kroeker b4b952eece
Add auxiliary tracking space for thread buffer frees too 4 years ago
  Martin Kroeker 7d1becc575
Allocate an auxiliary struct when running out of preconfigured threads 4 years ago
  Martin Kroeker 898212efcd
Actually add the message to the TLS section 4 years ago
  Martin Kroeker 210a1584c5
Rebase source and edit TLS version of the message as well 4 years ago
  Martin Kroeker f2a7a67f5a
Improve the "tried to allocate too many buffers" error message 4 years ago
  Craig Watson 4d7dfe4845 Include Haiku in processor count checks 4 years ago
  JonasZhou 0fca36c8c3 Add cpu detection support for Zhaoxin processors 4 years ago
  River Dillon 2f6326a630 Remove <linux/unistd.h> 4 years ago
  Martin Kroeker 8f22ac552b
Add vendor string Shanghai as successor to Centaur 4 years ago
  Martin Kroeker eb2fdd3af0
Recognize newer Zhaoxin/Centaur processors as Nehalem 4 years ago
  User User-User 750719528a bugz 4 years ago
  User User-User 6423b282a1 dynamic_arch 4 years ago
  Martin Kroeker cbfd3c87e1
Recognize Intel Ice Lake SP as Cooper Lake 4 years ago
  Martin Kroeker 623d580b4c
Restore __volatile__ keyword 4 years ago
  Martin Kroeker 186368ddc3
Fix compilation with CLANG 4 years ago
  Martin Kroeker 1a3ad4b670
Fix signatures of the TLS-mode dll_callback and p_process_term functions for Win64 5 years ago
  Peter Hawkins dbbf92c1d1 Fix race in blas_thread_shutdown. 5 years ago
  Martin Kroeker cb429d6b12
Merge pull request #3110 from martin-frbg/issue3108 5 years ago
  Martin Kroeker b0bded3f2f
Fix get_num_procs() in the USE_TLS branch for non-glibc systems 5 years ago
  Martin Kroeker e4e5042e38
Recognize Intel Tiger Lake as SkylakeX 5 years ago
  Martin Kroeker 0cc36770f1
Merge pull request #3073 from xoviat/embedded 5 years ago
  Martin Kroeker eea0c0f2ed
Merge pull request #3085 from alexhenrie/memory_alloc 5 years ago
  Martin Kroeker 0cb9e9fc8d
Remove the VORTEX support bits again for now 5 years ago
  Alex Henrie 113840da12 Fix null pointer check in blas_memory_alloc 5 years ago
  Martin Kroeker deb2e66bcc
Add DYNAMIC_LIST support for ARM64 5 years ago
  xoviat 2e8d6e8690 add functions for embedded 5 years ago
  Martin Kroeker b94dab5250
patch to support power10 in builtin_cpu_is was backported to gcc 10.2, so allow that as wel 5 years ago
  Martin Kroeker 63fa3c3f8f
Require gcc 11 for builtin_cpu_is(power10) 5 years ago