NEW! AOCL 5.1 is now available, May 7, 2025

AOCL is a set of numerical libraries optimized for AMD processors based on the AMD “Zen” core architecture and generations. Supported processor families are AMD EPYC™, AMD Ryzen™, and AMD Ryzen™ Threadripper™ processors. The tuned implementations of industry-standard math libraries enable rapid development of scientific and high-performance computing applications.

Libraries

What’s new in AOCL 5.1 (May 7, 2025)
  • AOCL Build-It-Yourself (Beta)
    • AOCL now offers the capability to compile individual libraries and consolidate them into a unified binary. With the Build-It-Yourself feature, you can choose one or more AOCL libraries and merge them into a single library by configuring the appropriate CMake options.
    • This unified binary is assigned a default name: libaocl.so/ libaocl.a for Linux and aocl.dll/ aocl.lib for Windows
    • This approach simplifies integration by eliminating dependencies on library linking order and preventing API duplication, ensuring smooth and efficient incorporation of multiple AOCL libraries.
  • AOCL-Data Analytics 
    • New APIs for DBSCAN clustering, support vector machines (SVM) and pairwise distances.
    • Dynamic dispatch: multiple Zen generation code paths compiled in a single binary, with appropriate code path selected at runtime.
    • Performance improvements to decision trees, random forest and k-nearest neighbors algorithms.
  • AOCL-BLAS
    • Performance Optimizations
      • DGEMM, DTRSM, DGEMV, ZGEMM, DTRSV, DCOPYV on Zen4/5
      • DSCALV, DDOTV on Zen3
    • Benchmark support for ASUMV
    • Aocl-gemm Add-on Module updates
      • AOCL_ENABLE_INSTRUCTIONS support
      • batch_gemm support for all data types
      • New Output Datatype for Integer APIs
      • BF16 Support on AVX2 Platforms
      • WOQ with/without Group Quantization
      • Threading Framework Optimizations
      • Reference Kernels for all Reorder APIs
      • Performance Optimizations for all APIs
      • Additional APIs and Post-Ops support in addition to the improved performance for the existing APIs in aocl_gemm add-on.
  • AOCL-Compression
    • Introduced multithreaded APIs for GZIP and Raw Deflate formats.
    • Improved single threaded compression performance for Bzip2, Snappy and decompression performance for Bzip2, Snappy, Gzip, ZSTD.
    • AOCL_DECOMPRESS_FAST support extended to Snappy.
    • Integrated third party utilities: minigzip and zstd utility for benchmarking.
  • AOCL-Cryptography
    • Performance improvements in SHA3.
    • AES-GCM optimizations for real world applications.
    • OpenSSL provider support for SHA3 and SHAKE algorithms.
    • Runtime forcing of CPUID / micro arch code path support using aocl-utils
    • CMake preset support.
    • Bench and test framework improvements.
    • Coverity high and majority of medium severity defects fixed.
  • AOCL-FFTW
    • Same as the 5.0 release.
  • AOCL-LAPACK
    • Performance improvements in the following routines:
      • LU Factorizations (DGETRF & DGBTRF), Least Square Solver (ZGELS)
      • SVD (DGESDD) and Eigen Decomposition (DSYEVD)
    • Upgrade to Netlib LAPACK 3.12.0 specifications
    • Updates to CMAKE presets, pkg-config for ease of build and integration
    • Test cases added for C++ template interfaces
  • AOCL-LibM
    • Added new vector variants of sincos.
    • Improved performance of vector variants of linearfrac.
    • Added CMAKE support (experimental) for building AOCL-LibM binaries on Linux and Windows.
    • Added static dispatch functionality while building with CMAKE.
    • Updated dynamic dispatch to route to Zen3 versions of APIs on Zen5 with AVX512 ISA disabled.
  • AOCL-LibMem
    • Optimized memory functions for ZEN5 microarchitecture.
    • Optimized string functions for AVX512 and AVX2
    • Enabled support for Dynamic Dispatcher
    • Support for new string functions: strchr, strncat
    • Bench and Test Framework Improvements
  • AOCL-RNG
    • AVX512 support for double precision MRG32K3A Generator.
    • Updated the AOCL-SecureRNG library to adjust the retrial count.
  • AOCL-ScaLAPACK
    • Improved PDSYEVD performance for flat grid configurations
    • pkgconfig support for better integration with AOCL dependencies
  • AOCL-Sparse
    • Multithreaded support for CSRMM, SpMV Transposed
    • Performance improvements:
      • Level 2: SpMV variants (Hermitian, Symmetric, Transposed)
      • Level 3: CSRMM
    • Enable dynamic dispatch
    • Support for complex data types in iterative methods, ILU0
    • Support for CSC storage format in TRSV
    • Framework improvements
    • Build system enhancements:
      • Enable CMake presets
      • Allow separate build of tests
  • AOCL-Utils
    • Implemented Hardware and System Rng.
    • Implemented extendable and flexible thread safe logger.
    • Support for isZenFamily.
  • AOCL-Enabled MUMPS Library
    • Building for Windows 10 systems: CMake-based build system on AOCL-enabled MUMPS sparse solver library; refer to GitHub MUMPS-build (https://github.com/amd/mumps-build) for more information.
    • Mumps 5.5.1 support with 5.5.0 and 5.4.1 backward compatibility.

Resources and Technical Support

Documentation

AOCL User Guide

AOCL Tuning Guide

AOCL Release Notes

AOCL API Guide

AOCL Build-It-Yourself Source code: GitHub

For prior versions of AOCL, refer to AOCL Archive.

Support

For support options, refer to Technical Support.

AMD Community

For moderated forums, refer to the AMD community.

Download with End User License Agreement

File Name Version Size Launch Date OS Bitness Description
AOCL 5.0 binary packages compiled with AOCC 5.0
aocl-linux-aocc-5.0.0.tar.gz 5.0 105MB 10/10/2024 RHEL, Ubuntu, SLES 64-bit AOCC compiled AOCL tar file containing all the library binaries. It includes install.sh file that extracts and installs the libraries.
SHA-256 checksum: 604ae6c34ef2648504ba08d2f33e42b5a9cf4032eaefaa9253560747ebaeb1ca
aocl-linux-aocc-5.0.0_1_amd64.deb 5.0 72MB 10/10/2024 Ubuntu 64-bit AOCC compiled Debian package
SHA-256 checksum: f397a03c92525513b298aba949c9cffcf458ff640e6e7eb0a0c6782827d55168
aocl-linux-aocc-5.0.0-1.x86_64.rpm 5.0 84MB 10/10/2024 RHEL, SLES 64-bit AOCC compiled RPM package
SHA-256 checksum: 44127f2df32b9a946832cf73846cdd35c63771c11f0527d44d1e88babac8b054
AOCL 5.1 binary packages compiled with GCC 14.2.1
aocl-linux-gcc-5.1.0.tar.gz 5.1 143MB 05/07/2025 RHEL, Ubuntu, SLES 64-bit GCC compiled AOCL tar file containing all the library binaries. Includes install.sh file that extracts and installs the libraries.
SHA-256 checksum: 73ad28baaa1096972804c1a1d42b8ed4724c0b98b60806e94ef6322bc86afa8b
aocl-linux-gcc-5.1.0_1_amd64.deb 5.1 87MB 05/07/2025 Ubuntu 64-bit GCC compiled Debian package
SHA-256 checksum: b27be8e8f564ee3417438748bc281b5398155c361dc9633af83c0e9fc94b8784
aocl-linux-gcc-5.1.0-1.x86_64.rpm 5.1 106MB 05/07/2025 RHEL, SLES 64-bit GCC compiled RPM package
SHA-256 checksum: 2641a2efacce40721cd372b77ca1ebdc98cfcae78b17b9253aa149e4c4baeeb1
Windows
AOCL_Windows-setup-5.1.0.408-AMD.exe 5.1 140MB 05/07/2025 Windows 11, Windows 10 64-bit Windows installer file containing all the AOCL library binaries compiled with Clang 18.
SHA-256 checksum: 61de98148459270ba3bd01f5b0c409299d8c0c144a925f3701e10f72678a328a