| @@ -1,4 +1,47 @@ | |||||
| OpenBLAS ChangeLog | OpenBLAS ChangeLog | ||||
| ==================================================================== | |||||
| Version 0.3.7 | |||||
| 11-Aug 2019 | |||||
| common: | |||||
| * having the gmake special variables TARGET_ARCH or TARGET_MACH | |||||
| defined no longer causes build failures in ctest or utest | |||||
| * defining NO_AFFINITY or USE_TLS to 0 in gmake builds no longer | |||||
| has the same effect as setting them to 1 | |||||
| * a new test program was added to allow checking the library for | |||||
| thread safety | |||||
| * a new option USE_LOCKING was added to ensure thread safety when | |||||
| OpenBLAS itself is built without multithreading but will be | |||||
| called from multiple threads. | |||||
| * a build failure on Linux with glibc versions earlier than 2.5 | |||||
| was fixed | |||||
| * a runtime error with CPU enumeration (and NO_AFFINITY not set) | |||||
| on glibc 2.6 was fixed | |||||
| * NO_AFFINITY was added to the CMAKE options (and defaults to being | |||||
| active on Linux, as in the gmake builds) | |||||
| x86_64: | |||||
| * the build-time logic for detection of AVX512 availability in | |||||
| the processor and compiler was fixed | |||||
| * gmake builds on OSX now set the internal name of the library to | |||||
| libopenblas.0.dylib (consistent with CMAKE) | |||||
| * the Haswell DGEMM kernel received a significant speedup through | |||||
| improved prefetch and load instructions | |||||
| * performance of DGEMM, DTRMM, DTRSM and ZDOT on Zen/Zen2 was markedly | |||||
| increased by avoiding vpermpd instructions | |||||
| * the SKYLAKEX (AVX512) DGEMM helper functions have now been disabled | |||||
| to fix remaining errors in DGEMM, DSYMM and DTRMM | |||||
| ## POWER: | |||||
| * added support for building on FreeBSD/powerpc64 and FreeBSD/ppc970 | |||||
| * added optimized kernels for POWER9 single and double precision complex BLAS3 | |||||
| * added optimized kernels for POWER9 SGEMM and STRMM | |||||
| ## ARMV7: | |||||
| * fixed the softfp implementations of xAMAX and IxAMAX | |||||
| * removed the predefined -march= flags on both ARMV5 and ARMV6 as | |||||
| they were appropriate for only a subset of platforms | |||||
| ==================================================================== | ==================================================================== | ||||
| Version 0.3.6 | Version 0.3.6 | ||||
| 29-Apr-2019 | 29-Apr-2019 | ||||