|
1 | 1 | OpenBLAS ChangeLog
|
| 2 | +==================================================================== |
| 3 | +Version 0.3.7 |
| 4 | +11-Aug 2019 |
| 5 | + |
| 6 | +common: |
| 7 | + * having the gmake special variables TARGET_ARCH or TARGET_MACH |
| 8 | + defined no longer causes build failures in ctest or utest |
| 9 | + * defining NO_AFFINITY or USE_TLS to 0 in gmake builds no longer |
| 10 | + has the same effect as setting them to 1 |
| 11 | + * a new test program was added to allow checking the library for |
| 12 | + thread safety |
| 13 | + * a new option USE_LOCKING was added to ensure thread safety when |
| 14 | + OpenBLAS itself is built without multithreading but will be |
| 15 | + called from multiple threads. |
| 16 | + * a build failure on Linux with glibc versions earlier than 2.5 |
| 17 | + was fixed |
| 18 | + * a runtime error with CPU enumeration (and NO_AFFINITY not set) |
| 19 | + on glibc 2.6 was fixed |
| 20 | + * NO_AFFINITY was added to the CMAKE options (and defaults to being |
| 21 | + active on Linux, as in the gmake builds) |
| 22 | + |
| 23 | +x86_64: |
| 24 | + * the build-time logic for detection of AVX512 availability in |
| 25 | + the processor and compiler was fixed |
| 26 | + * gmake builds on OSX now set the internal name of the library to |
| 27 | + libopenblas.0.dylib (consistent with CMAKE) |
| 28 | + * the Haswell DGEMM kernel received a significant speedup through |
| 29 | + improved prefetch and load instructions |
| 30 | + * performance of DGEMM, DTRMM, DTRSM and ZDOT on Zen/Zen2 was markedly |
| 31 | + increased by avoiding vpermpd instructions |
| 32 | + * the SKYLAKEX (AVX512) DGEMM helper functions have now been disabled |
| 33 | + to fix remaining errors in DGEMM, DSYMM and DTRMM |
| 34 | + |
| 35 | +## POWER: |
| 36 | + * added support for building on FreeBSD/powerpc64 and FreeBSD/ppc970 |
| 37 | + * added optimized kernels for POWER9 SGEMM and STRMM |
| 38 | + |
| 39 | +## ARMV7: |
| 40 | + * fixed the softfp implementations of xAMAX and IxAMAX |
| 41 | + * removed the predefined -march= flags on both ARMV5 and ARMV6 as |
| 42 | + they were appropriate for only a subset of platforms |
| 43 | + |
2 | 44 | ====================================================================
|
3 | 45 | Version 0.3.6
|
4 | 46 | 29-Apr-2019
|
|
0 commit comments