AOCL-BLAS
AOCL-BLAS provides a high-performance implementation of the Basic Linear Algebra Subprograms (BLAS), which provide the essential kernels of matrix and vector computation—among the most used computationally intensive operations in dense numerical linear algebra. Select kernels have been optimized for the AMD “Zen”-based processors, including AMD EPYC™, AMD Ryzen™, and AMD Ryzen™ Threadripper™ processors.
AOCL-BLAS is developed as a forked version of BLIS (https://github.com/flame/blis), originally developed by members of the Science of High-Performance Computing (SHPC) group in the Institute for Computational Engineering and Sciences at The University of Texas at Austin and other collaborators (including AMD). The library retains all known BLIS features and adds:
- Standard BLAS and CBLAS interfaces
- C++ template interfaces for BLAS functionalities
Highlights of AOCL-BLAS 5.2
- Performance optimizations on Zen4, Zen5 architectures
- DGEMV, DGEMM, ZGEMM, CGEMM, DCOPY
- Multi-threading and General Stride support for DGEMV
- Numerical precision improvement for ZGEMV
- Support for the complete set of GEMMTR APIs
- Build options to disable certain code paths in GEMM and TRSM
- CMake and configure options to enable DTL Logging
- Updates to aocl-gemm add-on module
- Column Major support for BF16 and FP32
- FP32 RD kernels for AVX512 and AVX2 ISA
- GEMV kernel for m=1 case using AVX2 and AVX512 YMM registers
- You can find the package containing AOCL-BLAS Library binaries that includes optimizations for AMD processors, examples, and documentation in the Downloads section.
Documentation
- Source code: GitHub.
AOCL-LAPACK
AOCL-LAPACK is a high-performance implementation of Linear Algebra PACKage (LAPACK), which provides routines for solving systems of linear equations, least-squares problems, eigenvalue problems, singular value problems, and the associated matrix factorizations. Extensible, easy to use, and available under an open-source license, AOCL-LAPACK can be utilized by applications relying on standard Netlib LAPACK interfaces with virtually no changes to their source code. AOCL-LAPACK supports C, Fortran, and C++ template interfaces (for a subset of APIs) for the LAPACK APIs.
AOCL-LAPACK is compatible with the LAPACK 3.12.0 specification. Combined with the AOCL-BLAS library, which includes optimizations for the AMD “Zen”-based processors, AOCL-LAPACK enables running high performing LAPACK functionalities on AMD platforms.
Highlights of AOCL-LAPACK 5.2
- Improved performance of the following routines
- LU, Cholesky and QR Factorizations (DGETRF, DPOTRF & DGEQRF)
- Symmetric Eigen Decomposition (DSYEVD)
- Matrix Inverse routines (DGETRI & DPOTRI) for small sizes
- Build System update
- AOCL-LAPACK now supports only CMAKE based build; autoconf based build is not supported anymore
- Added two new modes (avx2-strict, avx512-strict) under LF_ISA_CONFIG build flag to enforce specific ISA during execution
- Test suite framework enhancements
- Bit Reproducibility tests for test-suite supported LAPACK APIs
- Introduced Benchmark Mode to run tests for fixed duration, displaying additional metrics, and more
Documentation
- AOCL-LAPACK API Guide
- Prior versions: AOCL-BLAS and AOCL-LAPACK Library Archive
- Source code: GitHub
Downloads
| File Name | Version | Size | Launch Date | OS | Bitness | Description |
| Binary packages compiled with AOCC 5.1 | ||||||
| aocl-blis-linux-aocc-5.2.0.tar.gz | 5.2 | 39MB | 12/31/2025 | RHEL, Ubuntu, SLES | 64-bit | AOCC compiled AOCL-BLAS library binary package SHA-256 checksum: db7925c26f085f94ff5054939968b56547552efc1f15235f933f5b9587061ae7 |
| aocl-libflame-linux-aocc-5.2.0.tar.gz | 5.2 | 30MB | 12/31/2025 | RHEL, Ubuntu, SLES | 64-bit | AOCC compiled AOCL-LAPACK Library binary package SHA-256 checksum: 8360919226b5bd1aac6b93a39ca9452b51a0112a1d3112ad26763454cadc7e1d |
| Binary packages compiled with GCC 14.2.1 | ||||||
| aocl-blis-linux-gcc-5.2.0.tar.gz | 5.2 | 53MB | 12/31/2025 | RHEL, Ubuntu, SLES | 64-bit | GCC compiled AOCL-BLAS library binary package SHA-256 checksum: 0c7fbe06057c42e7591534da558f1e3176c0f401d55d731f62ce9c11fa4d70da |
| aocl-libflame-linux-gcc-5.2.0.tar.gz | 5.2 | 33MB | 12/31/2025 | RHEL, Ubuntu, SLES | 64-bit | GCC compiled AOCL- LAPACK Library binary package SHA-256 checksum: b6b014309d5f8433a49de3f2ad05ce1144e340eef649156edd73da20ba9aa194 |
| Windows Installer Compiled with Clang 18 | ||||||
| AOCL_Windows-setup-5.2.0-AMD.exe | 5.2 | 154MB | 12/31/2025 | Windows 11, Windows 10 | 64-bit | Windows installer file containing all the AOCL library binaries compiled with Clang 18. SHA-256 checksum: 0caf97420a5e1372aab6c70160f977f4a36c26cee89b88856145af82df2fe096 |