OpenBLAS

Commit Graph

Author	SHA1	Message	Date
oon3m0oo	bdb29242a3	Merge `ba586c3d16` into `4dd70d98d7`	8 years ago
Martin Kroeker	504310eeb9	Merge pull request #1665 from martin-frbg/cpuid-ryzen2 Add cpuid for AMD Ryzen 2	8 years ago
Martin Kroeker	d0ec4325cf	Add cpuid for AMD Ryzen 2	8 years ago
Craig Donner	ba586c3d16	Ensure that the gotoblas lookup table is always initialized. It is possible to build a program that calls a non-GEMM OpenBLAS routine from a static initializer. Since the order of initialization is undefined, and even less defined when using __attribute__((constructor)) in one TU and a C++ static initializer in another TU, it can happen (and does, unfortunately) that gotoblas_init is not called before the first BLAS routine. This results in a segfault when trying to index into the gotoblas table. The solution I have here is indirection: rather than directly using the table use an inlined function to first check if it's been initialized. Since it will only not have been done once, hopefully the branch prediction still keeps things fast.	8 years ago
Martin Kroeker	9d15a3bd16	Fix typo that broke compilation with DYNAMIC_ARCH and NO_AVX2 fixes 1659	8 years ago
Martin Kroeker	750162a05f	Try gradual fallback for cores not in the dynamic core list	8 years ago
Martin Kroeker	1833a67071	Add support for a user-defined list of dynamic targets	8 years ago
Martin Kroeker	63f7395fb4	Move some DYNAMIC_ARCH targets to new DYNAMIC_OLDER option	8 years ago
Martin Kroeker	38ad05bd04	Extend loop range to find SkylakeX in force_coretype	8 years ago
Martin Kroeker	8be027e4c6	Update dynamic.c	8 years ago
Martin Kroeker	ac7b6e3e9a	Fix misplaced endif	8 years ago
Martin Kroeker	ef626c6824	typo fix	8 years ago
Martin Kroeker	5a51cf4576	Separate Skylake X from Skylake	8 years ago
Arjan van de Ven	99c7bba8e4	Initial support for SkylakeX / AVX512 This patch adds the basic infrastructure for adding the SkylakeX (Intel Skylake server) target. The SkylakeX target will use the AVX512 (AVX512VL level) instruction set, which brings 2 basic things: 1) 512 bit wide SIMD (2x width of AVX2) 2) 32 SIMD registers (2x the number on AVX2) This initial patch only contains a trivial transofrmation of the Haswell SGEMM kernel to AVX512VL; more will follow later but this patch aims to get the infrastructure in place for this "later". Full performance tuning has not been done yet; with more registers and wider SIMD it's in theory possible to retune the kernels but even without that there's an interesting enough performance increase (30-40% range) with just this change.	8 years ago
Isuru Fernando	2f12ea017b	No strncasecmp with MSVC	8 years ago
Gian-Carlo Pascutto	9c884986ad	Add an extra familiy/model combination used by AMD Steamrolller (Godavari).	9 years ago
Gian-Carlo Pascutto	0cbd2d34e4	Recognize ZEN when passed as OPENBLAS_CORETYPE.	9 years ago
Gian-Carlo Pascutto	62979fd104	Fix dynamic detection for ZEN CPUs.	9 years ago
Denis Steckelmacher	c9ff735da6	Add ZEN support (tested for auto-detected static backend)	9 years ago
Andrew	5088523786	detect apollo lake for real	9 years ago
Elliot Saba	1d8ab99e09	Add `exfamily == 9` case (Kaby Lake) to dynamic arch detection	9 years ago
Martin Koehler	76c6e33e54	Enable EXCAVATOR kernels for A12-9800	9 years ago
Martin Kroeker	596ead0f8d	Add files via upload	9 years ago
Martin Kroeker	8a8f3932eb	Update dynamic.c Add Bay Trail "Pentium N3520" atom	9 years ago
Martin Kroeker	7de829f713	Update dynamic.c Add Braswell (extended model 4, model 12) N3150 as Nehalem	10 years ago
Werner Saar	2b967590a0	bugfix in dynamic.c	10 years ago
Zhang Xianyi	1edf30b790	Change Opteron(SSE3) to Opteron_SSE3 at dyanmaic core name.	10 years ago
Martin Kroeker	935356c34f	Update dynamic.c and cpuid_x86.c for Intel Avoton. Second part of "support Intel Avoton via Nehalem kernel"	10 years ago
Zhang Xianyi	839395fc25	Detect AMD Trinity and Richland.	10 years ago
Zhang Xianyi	cc7cab8a45	Detect other Intel Skylake cores. http://users.atw.hu/instlatx64/	10 years ago
Yichao Yu	61ae47eb99	Ref #632 . Support Intel Skylake by Haswell kernels.	10 years ago
Zhang Xianyi	51ff17d46e	Add AMD Excavator target.	11 years ago
Zhang Xianyi	8977b3f235	Refs #529 . Support Intel Broadwell by Haswell kernels.	11 years ago
Zhang Xianyi	e95d64333a	Refs #519 . Avoid calling strncpy.	11 years ago
Werner Saar	0dc559ed30	bugfix in dynamic.c	11 years ago
Werner Saar	4319769b79	added target processor STEAMROLLER	11 years ago
Zhang Xianyi	c94762bb56	Refs #401 . Added NO_AVX2 flag for old binutils (e.g. RHEL6)	12 years ago
Timothy Gu	6c2ead30f0	Remove all trailing whitespace except lapack-netlib Signed-off-by: Timothy Gu <timothygu99@gmail.com>	12 years ago
wernsaar	53bfa51ee0	Ref #385 : fixed warnings in dynamic.c	12 years ago
wernsaar	a86d349a51	Ref #380 : enhancements for dynamic_arch	12 years ago
Zhang Xianyi	8c7687b419	Refs #338 . Added OPENBLAS_VERBOSE environment variable on runtime By default, OpenBLAS doesn't output the warning message. You can set OPENBLAS_VERBOSE (e.g. export OPENBLAS_VERBOSE=1) to enable the warning message on runtime.	12 years ago
Zhang Xianyi	ab69443bd4	Refs #332 . Added addtional Intel Ivy Bridge and Haswell CPU-id.	12 years ago
wernsaar	5c648a8984	Merge remote branch 'origin/develop' into haswell	12 years ago
Sébastien Villemot	eae4cfa3f6	Avoid failure on qemu guests declaring an Athlon CPU without 3dnow! The present patch verifies that, on machines declaring an Athlon CPU model and family, the 3dnow and 3dnowext feature flags are indeed present. If they are not, it fallbacks on the most generic x86 kernel. This prevents crashes due to illegal instruction on qemu guests with a weird configuration. Closes #272	12 years ago
Zhang Xianyi	2638370844	Init code base for Intel Haswell.	13 years ago
Zhang Xianyi	673e453b3f	Enable bulldozer kernels.	13 years ago
Zhang Xianyi	5b504d6c23	Refs #263 . Rollback bulldozer and piledriver kernels to barcelona kernels.	13 years ago
Zhang Xianyi	886cbaf4e4	Support AMD Piledriver by bulldozer kernels.	13 years ago
Dan Luu	88ef307cef	Refs #241 . Add Haswell support (using sandybridge optimizations)	13 years ago
Zhang Xianyi	65ffead0cf	Refs #124 . Check XSAVE flag on x86 CPU.	13 years ago

1 2

64 Commits (bdb29242a38e286680903d49e3e3d3fe05eee310)