Martin Kroeker
22a616bd8f
Add model number for Tiger Lake H (mobile variant)
4 years ago
Marius Hillenbrand
44950ca173
s390x: use DYNAMIC_ARCH's cpu detection for compile-time choice
On s390x, the run-time detection for DYNAMIC_ARCH and the compile-time
choice in cpuid_zarch use different methods for identifying the
supported CPU features. To make cpuid_zarch future-proof and both easier
to maintain, switch cpuid_zarch to the same mechanism as DYNAMIC_ZARCH
(i.e., derive the supported CPU features from hwcap flags) and share
code between both (in a new header cpuid_zarch.h).
Signed-off-by: Marius Hillenbrand <mhillen@linux.ibm.com>
4 years ago
Wangyang Guo
3dc6052c7e
initial support for Sapphire Rapids platform
4 years ago
Rafael Cardoso Fernandes Sousa
0e8b4adf22
Remove unused commented code (#if directive)
4 years ago
Martin Kroeker
fa8bf57768
Merge pull request #3380 from martin-frbg/structwarn
Remove extraneous qualifiers from struct definition
4 years ago
Martin Kroeker
dd09f0173e
Remove extraneous qualifiers from struct definition
4 years ago
Martin Kroeker
2f8220d757
Add sbgemm
4 years ago
Martin Kroeker
5f6a609253
Add sbgemv
4 years ago
Wangyang Guo
045ed5c91d
sbgemm: fix build error in BFLOAT16 disabled
4 years ago
Wangyang Guo
8356a604f0
sbgemm: cooperlake: tuning for block params
4 years ago
Martin Kroeker
cd10d1c03b
Fix typo
4 years ago
Martin Kroeker
2db1a99aca
Clean up debug messages
4 years ago
Martin Kroeker
89fc5b8f4f
Fix unmap logic
4 years ago
Martin Kroeker
7fd12a5e69
Add likely() hints for gcc
4 years ago
Martin Kroeker
2ba9a567aa
Fix typo
4 years ago
Martin Kroeker
b4b952eece
Add auxiliary tracking space for thread buffer frees too
4 years ago
Martin Kroeker
7d1becc575
Allocate an auxiliary struct when running out of preconfigured threads
4 years ago
Martin Kroeker
898212efcd
Actually add the message to the TLS section
4 years ago
Martin Kroeker
210a1584c5
Rebase source and edit TLS version of the message as well
4 years ago
Martin Kroeker
f2a7a67f5a
Improve the "tried to allocate too many buffers" error message
4 years ago
Craig Watson
4d7dfe4845
Include Haiku in processor count checks
4 years ago
JonasZhou
0fca36c8c3
Add cpu detection support for Zhaoxin processors
Signed-off-by: JonasZhou <JonasZhou@zhaoxin.com>
4 years ago
River Dillon
2f6326a630
Remove <linux/unistd.h>
4 years ago
Martin Kroeker
8f22ac552b
Add vendor string Shanghai as successor to Centaur
4 years ago
Martin Kroeker
eb2fdd3af0
Recognize newer Zhaoxin/Centaur processors as Nehalem
4 years ago
User User-User
750719528a
bugz
4 years ago
User User-User
6423b282a1
dynamic_arch
4 years ago
Martin Kroeker
307c4c0786
Fix typo
4 years ago
Martin Kroeker
e83df93975
Work around another recent macro name collision with winnt.h
4 years ago
Martin Kroeker
cbfd3c87e1
Recognize Intel Ice Lake SP as Cooper Lake
4 years ago
Martin Kroeker
623d580b4c
Restore __volatile__ keyword
4 years ago
Martin Kroeker
186368ddc3
Fix compilation with CLANG
4 years ago
Martin Kroeker
1a3ad4b670
Fix signatures of the TLS-mode dll_callback and p_process_term functions for Win64
5 years ago
Peter Hawkins
dbbf92c1d1
Fix race in blas_thread_shutdown.
blas_server_avail was read without holding server_lock. If multiple threads call blas_thread_shutdown simultaneously, for example, by calling fork(), then they can attempt to shut down multiple times. This can lead to a segmentation fault.
5 years ago
Martin Kroeker
cb429d6b12
Merge pull request #3110 from martin-frbg/issue3108
Fix get_num_procs() in the USE_TLS branch for non-glibc systems
5 years ago
Martin Kroeker
b0bded3f2f
Fix get_num_procs() in the USE_TLS branch for non-glibc systems
5 years ago
Martin Kroeker
e4e5042e38
Recognize Intel Tiger Lake as SkylakeX
5 years ago
Martin Kroeker
0cc36770f1
Merge pull request #3073 from xoviat/embedded
add embedded option
5 years ago
Martin Kroeker
eea0c0f2ed
Merge pull request #3085 from alexhenrie/memory_alloc
Fix null pointer check in blas_memory_alloc
5 years ago
Martin Kroeker
0cb9e9fc8d
Remove the VORTEX support bits again for now
5 years ago
Alex Henrie
113840da12
Fix null pointer check in blas_memory_alloc
5 years ago
Martin Kroeker
deb2e66bcc
Add DYNAMIC_LIST support for ARM64
5 years ago
xoviat
2e8d6e8690
add functions for embedded
5 years ago
Martin Kroeker
b94dab5250
patch to support power10 in builtin_cpu_is was backported to gcc 10.2, so allow that as wel
5 years ago
Martin Kroeker
63fa3c3f8f
Require gcc 11 for builtin_cpu_is(power10)
fixes #3074
5 years ago
xoviat
b60de4447a
add cortex-m platform
5 years ago
Martin Kroeker
2c445be8ba
Merge pull request #3051 from martin-frbg/rocketlake
Add CPUID information for Intel Rocket Lake
5 years ago
Martin Kroeker
6fe0f1fab9
Label get_cpu_ftr as volatile to keep gcc from rearranging the code
5 years ago
Martin Kroeker
17c16f2a71
Implement builtin_cpu_is and limit cpu choices to P8 and P9 for NVIDIA compilers
5 years ago
Martin Kroeker
865676682d
Add Intel Rocket Lake
5 years ago