Skip to content

AOCL-BLIS 3.2

Compare
Choose a tag to compare
@dzambare dzambare released this 09 Jul 03:01
· 1094 commits to master since this release

New features:

  • Extended BLAS function - DZGEMM
  • Progress feature for xGEMM and xTRSM APIs: Time taken to complete the mathematical operations tends to increase exponentially with large input problem sizes; this feature provides users a periodic update on the operation progress.
  • Runtime Threading control using OpenMP APIs
  • Dynamic Dispatch covers APUs
  • Improved detection of standard x86-64 feature support
  • Minor bug fixes

Performance improvements in the following single-threaded and multi-threaded functions:

  • DGEMM, SGEMM, ZGEMM, and CGEMM
  • DTRSM, DGEMMT, ZTRSM, CTRSM, and DTRMM
  • SGEMV, DHER2, ZTRSV, and DSYMV
  • ?AXPBYV, SSCALV, DSCALV, ?DOTXV, and ZAXPY2V