Daniel Cohen Gindi
63bbd7b0d7
Better support for MSVC/Windows in CMake
7 years ago
Martin Kroeker
010d59bfee
Merge pull request #1973 from martin-frbg/issue1464
Increase Zen SWITCH_RATIO to 16
7 years ago
Martin Kroeker
83b5c6b92d
Fix compilation with NO_AVX=1 set
fixes #1974
7 years ago
Martin Kroeker
bbfdd6c0fe
Increase Zen SWITCH_RATIO to 16
following GEMM benchmarks on Ryzen2700X. For #1464
7 years ago
Martin Kroeker
32b0f1168e
Fix declaration of input arguments in the Sandybridge GER microkernels ( #1967 )
* Tag arguments 0 and 1 as both input and output
7 years ago
Martin Kroeker
b495e54310
Fix declaration of input arguments in the x86_64 SCAL microkernels ( #1966 )
* Tag arguments 0 and 1 as both input and output (see #1964 )
7 years ago
Martin Kroeker
d5e6940253
Fix declaration of input arguments in the x86_64 microkernels for DOT and AXPY ( #1965 )
* Tag operands 0 and 1 as both input and output
For #1964 (basically a continuation of coding problems first seen in #1292 )
7 years ago
Martin Kroeker
24e697eadb
Merge pull request #1970 from quickwritereader/develop
crot fix
7 years ago
Martin Kroeker
3e9fd6359d
Bump xcode version to 10.1 to make sure it handles AVX512
7 years ago
Ubuntu
43a4572038
crot fix
7 years ago
Martin Kroeker
256eb588bb
Merge pull request #1963 from quickwritereader/develop
Blas1 single missing kernels implemented with vector builtins
7 years ago
Abdelrauf
a034e65512
Merge branch 'develop' into develop
7 years ago
Ubuntu
8c3386be87
Added missing Blas1 single fp {saxpy, caxpy, cdot, crot(refactored version of srot),isamax ,isamin, icamax, icamin},
Fixed idamin,icamin choosing the first occurance index of equal minimals
7 years ago
Martin Kroeker
1e3ada6db4
Merge pull request #1960 from cnjsdfcy/Hygon
Add support for Hygon Dhyana
7 years ago
caiyu
29dc72889f
Add support for Hygon Dhyana
7 years ago
Martin Kroeker
dbc9a060ef
Fix missing braces in support_av() call
7 years ago
Martin Kroeker
00401489c2
Fix missing braces in support_avx()
7 years ago
Martin Kroeker
21c0f2af7b
Merge pull request #1957 from martin-frbg/issue1954
Move TLS key deletion to openblas_quit
7 years ago
Martin Kroeker
ad2c386d6a
Move TLS key deletion to openblas_quit
fixes #1954 (as suggested by thrasibule in that issue)
7 years ago
Martin Kroeker
8d99dba86b
Merge pull request #1949 from martin-frbg/issue1947
Query AVX2 and AVX512VL support when selecting x86 kernels
7 years ago
Martin Kroeker
1650311246
Bump xcode to 8.3
7 years ago
Martin Kroeker
cf5d48e833
Update OSX environment to Sierra
as homebrew seems to have dropped support for El Capitan in their gcc packages
7 years ago
Martin Kroeker
191677b902
Add travis_wait to the OSX brew install phase
7 years ago
Martin Kroeker
31ed19e8b9
Add message for SkylakeX and KNL fallbacks to Haswell
7 years ago
Martin Kroeker
e1574fa2b4
Add xcr0 (os support) check
7 years ago
Martin Kroeker
68eb3146ce
Add xcr0 (os support) check
7 years ago
Martin Kroeker
0afaae4b23
Query AVX2 and AVX512VL capability in x86 cpu detection
7 years ago
Martin Kroeker
ae1d1f74f7
Query AVX2 and AVX512 capability for runtime cpu selection
7 years ago
Martin Kroeker
ed01f4932a
Merge pull request #1946 from martin-frbg/issue1908
More fixes for cross-compiling ARM64 targets
7 years ago
Martin Kroeker
802f0dbde1
More fixes for cross-compiling ARM64 targets
Fixed core naming for DYNAMIC_ARCH. Corrected GEMM_DEFAULT entries and added SYMV_P. Replaced outdated VULCAN define for ThunderX2T99 with ARMV8 to get basic definitions back. For issue #1908
7 years ago
Martin Kroeker
20d1aad13f
Fix missing quotes around thunderx targets
7 years ago
TiborGY
d11554c88f
Validate user supplied TARGET ( #1941 )
the build will now abort with an error message when an undefined build TARGET is named
Fixes #1938
7 years ago
Martin Kroeker
ed704185ab
Increment version to 0.3.6.dev
7 years ago
Martin Kroeker
2940798ea7
Increment version to 0.3.6.dev
7 years ago
Martin Kroeker
1c75b65d53
Merge branch 'release-0.3.0' into develop
7 years ago
Martin Kroeker
13d006339b
Update ChangeLog.txt with changes from 0.3.5
7 years ago
Martin Kroeker
bf76162635
Merge pull request #1944 from hartzell/patch-1
Typo: Skyalke -> Skylake
7 years ago
George Hartzell
0d52aefc6b
Typo: Skyalke -> Skylake
Worth fixing, it gets in the way of searching....
7 years ago
Martin Kroeker
a6787b0f81
Merge pull request #1939 from TiborGY/patch-2
Fix typo in UNKNOWN core name
7 years ago
Martin Kroeker
8643521127
Merge pull request #1943 from martin-frbg/issue1748
Re-enable loop unrolling in trmv and remove the scary warning
7 years ago
Martin Kroeker
5a720cf9ca
Re-enable loop unrolling in trmv and remove the scary warning
fixes #1748 as that half of the fix for #1332 appears to have been an overreaction on my part.
7 years ago
Martin Kroeker
ccd5945d38
Merge pull request #1942 from martin-frbg/issue1720
Delete the pthread key on cleanup in TLS mode
7 years ago
Martin Kroeker
9f80e0f5fc
Remove stray include of complex.h
already provided conditionally by common.h via openblas_utest.h
Unconditional inclusion breaks older Android and similar platforms that use OPENBLAS_COMPLEX_STRUCT
7 years ago
Martin Kroeker
bba1e67269
Delete the pthread key on cleanup in TLS mode
to avoid a crash when OpenBLAS was loaded via dlopen and libc tries to clean up the leaked TLS after dlclose
Fixes #1720
7 years ago
Martin Kroeker
93240f489e
Fix wrong case in TARGET setting for Alpine
7 years ago
TiborGY
7cbc2c37d6
Update cpuid_mips64.c
7 years ago
TiborGY
c329de2931
Update Makefile
7 years ago
TiborGY
187233953c
Update cpuid_mips.c
7 years ago
TiborGY
09170268a3
Update cpuid_arm.c
7 years ago
TiborGY
211120c508
Fix typo in UNKNOWN core name
Should be of no consequence, right?
7 years ago