AOCL-BLIS 3.2
New features:
- Extended BLAS function - DZGEMM
- Progress feature for xGEMM and xTRSM APIs: Time taken to complete the mathematical operations tends to increase exponentially with large input problem sizes; this feature provides users a periodic update on the operation progress.
- Runtime Threading control using OpenMP APIs
- Dynamic Dispatch covers APUs
- Improved detection of standard x86-64 feature support
- Minor bug fixes
Performance improvements in the following single-threaded and multi-threaded functions:
- DGEMM, SGEMM, ZGEMM, and CGEMM
- DTRSM, DGEMMT, ZTRSM, CTRSM, and DTRMM
- SGEMV, DHER2, ZTRSV, and DSYMV
- ?AXPBYV, SSCALV, DSCALV, ?DOTXV, and ZAXPY2V