Skip to content

AMD Optimized BLIS Version 3.0

Compare
Choose a tag to compare
@pradeeptrgit pradeeptrgit released this 16 Mar 05:25
· 1668 commits to master since this release
2728fc8

AMD Optimized BLIS Version 3.0

Highlights of improvements on AMD EPYCTM processor family CPUs

  • Includes support for AMD’s Zen3 architecture. Build can auto detect if it is running on zen3 and enable features and optimizations specific to zen3 architecture.
  • Improved performance of ?dotv, ?gemv, ?axpyv for complex and double complex datatypes
  • Includes support for copy transposition routines
  • New BLAS extension APIs added including cblas_?cabs1, cblas_i?amin, cblas_?axpby, cblas_?gemm_batch, cblas_?gemm3m
  • Debug trace and input logging support added for more BLIS APIs.