NEW! AOCL 5.1 is now available, May 7, 2025
AOCL is a set of numerical libraries optimized for AMD processors based on the AMD “Zen” core architecture and generations. Supported processor families are AMD EPYC™, AMD Ryzen™, and AMD Ryzen™ Threadripper™ processors. The tuned implementations of industry-standard math libraries enable rapid development of scientific and high-performance computing applications.
Libraries
- AOCL-Data Analytics
- AOCL-BLAS
- AOCL-LAPACK
- AOCL-Compression
- AOCL-Cryptography
- AOCL-FFTW (Fastest Fourier Transform in the West)
- AOCL-LibM (AMD Math Library)
- AOCL-LibMem
- AOCL-RNG (AMD Random Number Generator Library)
- AOCL-SecureRNG (Secure RNG Library)
- AOCL-ScaLAPACK
- AOCL-Sparse
- AOCL-Utils
What’s new in AOCL 5.1 (May 7, 2025)
- AOCL Build-It-Yourself (Beta)
- AOCL now offers the capability to compile individual libraries and consolidate them into a unified binary. With the Build-It-Yourself feature, you can choose one or more AOCL libraries and merge them into a single library by configuring the appropriate CMake options.
- This unified binary is assigned a default name: libaocl.so/ libaocl.a for Linux and aocl.dll/ aocl.lib for Windows
- This approach simplifies integration by eliminating dependencies on library linking order and preventing API duplication, ensuring smooth and efficient incorporation of multiple AOCL libraries.
- AOCL-Data Analytics
- New APIs for DBSCAN clustering, support vector machines (SVM) and pairwise distances.
- Dynamic dispatch: multiple Zen generation code paths compiled in a single binary, with appropriate code path selected at runtime.
- Performance improvements to decision trees, random forest and k-nearest neighbors algorithms.
- AOCL-BLAS
- Performance Optimizations
- DGEMM, DTRSM, DGEMV, ZGEMM, DTRSV, DCOPYV on Zen4/5
- DSCALV, DDOTV on Zen3
- Benchmark support for ASUMV
- Aocl-gemm Add-on Module updates
- AOCL_ENABLE_INSTRUCTIONS support
- batch_gemm support for all data types
- New Output Datatype for Integer APIs
- BF16 Support on AVX2 Platforms
- WOQ with/without Group Quantization
- Threading Framework Optimizations
- Reference Kernels for all Reorder APIs
- Performance Optimizations for all APIs
- Additional APIs and Post-Ops support in addition to the improved performance for the existing APIs in aocl_gemm add-on.
- Performance Optimizations
- AOCL-Compression
- Introduced multithreaded APIs for GZIP and Raw Deflate formats.
- Improved single threaded compression performance for Bzip2, Snappy and decompression performance for Bzip2, Snappy, Gzip, ZSTD.
- AOCL_DECOMPRESS_FAST support extended to Snappy.
- Integrated third party utilities: minigzip and zstd utility for benchmarking.
- AOCL-Cryptography
- Performance improvements in SHA3.
- AES-GCM optimizations for real world applications.
- OpenSSL provider support for SHA3 and SHAKE algorithms.
- Runtime forcing of CPUID / micro arch code path support using aocl-utils
- CMake preset support.
- Bench and test framework improvements.
- Coverity high and majority of medium severity defects fixed.
- AOCL-FFTW
- Same as the 5.0 release.
- AOCL-LAPACK
- Performance improvements in the following routines:
- LU Factorizations (DGETRF & DGBTRF), Least Square Solver (ZGELS)
- SVD (DGESDD) and Eigen Decomposition (DSYEVD)
- Upgrade to Netlib LAPACK 3.12.0 specifications
- Updates to CMAKE presets, pkg-config for ease of build and integration
- Test cases added for C++ template interfaces
- AOCL-LibM
- Added new vector variants of sincos.
- Improved performance of vector variants of linearfrac.
- Added CMAKE support (experimental) for building AOCL-LibM binaries on Linux and Windows.
- Added static dispatch functionality while building with CMAKE.
- Updated dynamic dispatch to route to Zen3 versions of APIs on Zen5 with AVX512 ISA disabled.
- AOCL-LibMem
- Optimized memory functions for ZEN5 microarchitecture.
- Optimized string functions for AVX512 and AVX2
- Enabled support for Dynamic Dispatcher
- Support for new string functions: strchr, strncat
- Bench and Test Framework Improvements
- AOCL-RNG
- AVX512 support for double precision MRG32K3A Generator.
- Updated the AOCL-SecureRNG library to adjust the retrial count.
- AOCL-ScaLAPACK
- Improved PDSYEVD performance for flat grid configurations
- pkgconfig support for better integration with AOCL dependencies
- AOCL-Sparse
- Multithreaded support for CSRMM, SpMV Transposed
- Performance improvements:
- Level 2: SpMV variants (Hermitian, Symmetric, Transposed)
- Level 3: CSRMM
- Enable dynamic dispatch
- Support for complex data types in iterative methods, ILU0
- Support for CSC storage format in TRSV
- Framework improvements
- Build system enhancements:
- Enable CMake presets
- Allow separate build of tests
- AOCL-Utils
- Implemented Hardware and System Rng.
- Implemented extendable and flexible thread safe logger.
- Support for isZenFamily.
- AOCL-Enabled MUMPS Library
- Building for Windows 10 systems: CMake-based build system on AOCL-enabled MUMPS sparse solver library; refer to GitHub MUMPS-build (https://github.com/amd/mumps-build) for more information.
- Mumps 5.5.1 support with 5.5.0 and 5.4.1 backward compatibility.
Resources and Technical Support
Documentation
AOCL Build-It-Yourself Source code: GitHub
For prior versions of AOCL, refer to AOCL Archive.
Support
For support options, refer to Technical Support.
AMD Community
For moderated forums, refer to the AMD community.
Download with End User License Agreement
File Name | Version | Size | Launch Date | OS | Bitness | Description |
AOCL 5.0 binary packages compiled with AOCC 5.0 | ||||||
aocl-linux-aocc-5.0.0.tar.gz | 5.0 | 105MB | 10/10/2024 | RHEL, Ubuntu, SLES | 64-bit | AOCC compiled AOCL tar file containing all the library binaries. It includes install.sh file that extracts and installs the libraries. SHA-256 checksum: 604ae6c34ef2648504ba08d2f33e42b5a9cf4032eaefaa9253560747ebaeb1ca |
aocl-linux-aocc-5.0.0_1_amd64.deb | 5.0 | 72MB | 10/10/2024 | Ubuntu | 64-bit | AOCC compiled Debian package SHA-256 checksum: f397a03c92525513b298aba949c9cffcf458ff640e6e7eb0a0c6782827d55168 |
aocl-linux-aocc-5.0.0-1.x86_64.rpm | 5.0 | 84MB | 10/10/2024 | RHEL, SLES | 64-bit | AOCC compiled RPM package SHA-256 checksum: 44127f2df32b9a946832cf73846cdd35c63771c11f0527d44d1e88babac8b054 |
AOCL 5.1 binary packages compiled with GCC 14.2.1 | ||||||
aocl-linux-gcc-5.1.0.tar.gz | 5.1 | 143MB | 05/07/2025 | RHEL, Ubuntu, SLES | 64-bit | GCC compiled AOCL tar file containing all the library binaries. Includes install.sh file that extracts and installs the libraries. SHA-256 checksum: 73ad28baaa1096972804c1a1d42b8ed4724c0b98b60806e94ef6322bc86afa8b |
aocl-linux-gcc-5.1.0_1_amd64.deb | 5.1 | 87MB | 05/07/2025 | Ubuntu | 64-bit | GCC compiled Debian package SHA-256 checksum: b27be8e8f564ee3417438748bc281b5398155c361dc9633af83c0e9fc94b8784 |
aocl-linux-gcc-5.1.0-1.x86_64.rpm | 5.1 | 106MB | 05/07/2025 | RHEL, SLES | 64-bit | GCC compiled RPM package SHA-256 checksum: 2641a2efacce40721cd372b77ca1ebdc98cfcae78b17b9253aa149e4c4baeeb1 |
Windows | ||||||
AOCL_Windows-setup-5.1.0.408-AMD.exe | 5.1 | 140MB | 05/07/2025 | Windows 11, Windows 10 | 64-bit | Windows installer file containing all the AOCL library binaries compiled with Clang 18. SHA-256 checksum: 61de98148459270ba3bd01f5b0c409299d8c0c144a925f3701e10f72678a328a |