AOCL-BLAS 4.1
AOCL-BLAS 4.1 Release Highlights
- Additional APIs and Post-Ops support in addition to the improved performance for the existing APIs in aocl_gemm add-on
- Dynamic dispatch and amdzen configuration support added to aocl_gemm add-on
- Dynamic dispatch feature enhancements.
- AVX 512-based optimizations for AMD “Zen4” platform:
- SGEMM, DGEMM, and ZGEMM
- DTRSM, D/ZAXPY, ZGEMV, DDOTV, and D/ZSCALV - Improved support for OpenMP nested parallelism.