Skip to content

AMD Optimized BLIS Version 2.1

Compare
Choose a tag to compare
@pradeeptrgit pradeeptrgit released this 14 Jan 04:35
· 1913 commits to master since this release

AMD Optimized BLIS Version 2.1

Highlights of improvements on AMD EPYCTM processor family CPUs

  • Improved performance of SGEMM and DGEMM for small and skinny size matrices
  • Improved TRSM single thread performance for small and skinny size matrices
  • BLIS build now supports both AMD "zen" and "zen2" configurations with auto config option
  • Support for C++ Template APIs for all BLAS functions