AOCL-BLAS

AOCL-BLAS provides a high-performance implementation of the Basic Linear Algebra Subprograms (BLAS), which provide the essential kernels of matrix and vector computation—among the most used computationally intensive operations in dense numerical linear algebra. Select kernels have been optimized for the AMD “Zen”-based processors, including AMD EPYC™, AMD Ryzen™, and AMD Ryzen™ Threadripper™ processors.

AOCL-BLAS is developed as a forked version of BLIS (https://github.com/flame/blis), originally developed by members of the Science of High-Performance Computing (SHPC) group in the Institute for Computational Engineering and Sciences at The University of Texas at Austin and other collaborators (including AMD). The library retains all known BLIS features and adds:

  • Standard BLAS and CBLAS interfaces
  • C++ template interfaces for BLAS functionalities 

Highlights of AOCL-BLAS 5.2

  • Performance optimizations on Zen4, Zen5 architectures
    • DGEMV, DGEMM, ZGEMM, CGEMM, DCOPY
  • Multi-threading and General Stride support for DGEMV
  • Numerical precision improvement for ZGEMV
  • Support for the complete set of GEMMTR APIs
  • Build options to disable certain code paths in GEMM and TRSM
  • CMake and configure options to enable DTL Logging
  • Updates to aocl-gemm add-on module
    • Column Major support for BF16 and FP32
    • FP32 RD kernels for AVX512 and AVX2 ISA
    • GEMV kernel for m=1 case using AVX2 and AVX512 YMM registers
  • You can find the package containing AOCL-BLAS Library binaries that includes optimizations for AMD processors, examples, and documentation in the Downloads section.

Documentation

AOCL-LAPACK

AOCL-LAPACK is a high-performance implementation of Linear Algebra PACKage (LAPACK), which provides routines for solving systems of linear equations, least-squares problems, eigenvalue problems, singular value problems, and the associated matrix factorizations. Extensible, easy to use, and available under an open-source license, AOCL-LAPACK  can be utilized by applications relying on standard Netlib LAPACK interfaces with virtually no changes to their source code. AOCL-LAPACK supports C, Fortran, and C++ template interfaces (for a subset of APIs) for the LAPACK APIs.

AOCL-LAPACK is compatible with the LAPACK 3.12.0 specification. Combined with the AOCL-BLAS library, which includes optimizations for the AMD “Zen”-based processors, AOCL-LAPACK enables running high performing LAPACK functionalities on AMD platforms.

Highlights of AOCL-LAPACK 5.2

  • Improved performance of the following routines
    • LU, Cholesky and QR Factorizations (DGETRF, DPOTRF & DGEQRF)
    • Symmetric Eigen Decomposition (DSYEVD)
    • Matrix Inverse routines (DGETRI & DPOTRI) for small sizes
  • Build System update
    • AOCL-LAPACK now supports only CMAKE based build; autoconf based build is not supported anymore
    • Added two new modes (avx2-strict, avx512-strict) under LF_ISA_CONFIG build flag to enforce specific ISA during execution
  • Test suite framework enhancements
    • Bit Reproducibility tests for test-suite supported LAPACK APIs
    • Introduced Benchmark Mode to run tests for fixed duration, displaying additional metrics, and more

Documentation

Downloads

File Name Version Size Launch Date OS Bitness Description
Binary packages compiled with AOCC 5.1
aocl-blis-linux-aocc-5.2.0.tar.gz 5.2 39MB 12/31/2025 RHEL, Ubuntu, SLES 64-bit AOCC compiled AOCL-BLAS library binary package
SHA-256 checksum: db7925c26f085f94ff5054939968b56547552efc1f15235f933f5b9587061ae7
aocl-libflame-linux-aocc-5.2.0.tar.gz 5.2 30MB 12/31/2025 RHEL, Ubuntu, SLES 64-bit AOCC compiled AOCL-LAPACK Library binary package
SHA-256 checksum: 8360919226b5bd1aac6b93a39ca9452b51a0112a1d3112ad26763454cadc7e1d
Binary packages compiled with GCC 14.2.1
aocl-blis-linux-gcc-5.2.0.tar.gz 5.2 53MB 12/31/2025 RHEL, Ubuntu, SLES 64-bit GCC compiled AOCL-BLAS library binary package
SHA-256 checksum: 0c7fbe06057c42e7591534da558f1e3176c0f401d55d731f62ce9c11fa4d70da
aocl-libflame-linux-gcc-5.2.0.tar.gz 5.2 33MB 12/31/2025 RHEL, Ubuntu, SLES 64-bit GCC compiled AOCL- LAPACK Library binary package
SHA-256 checksum: b6b014309d5f8433a49de3f2ad05ce1144e340eef649156edd73da20ba9aa194
Windows Installer Compiled with Clang 18
AOCL_Windows-setup-5.2.0-AMD.exe 5.2 154MB 12/31/2025 Windows 11, Windows 10 64-bit Windows installer file containing all the AOCL library binaries compiled with Clang 18.
SHA-256 checksum: 0caf97420a5e1372aab6c70160f977f4a36c26cee89b88856145af82df2fe096