Breaking Barriers in Genomics: Slorado Brings Open-Source Genomic Basecalling to AMD GPUs

Nov 24, 2025

DNA Strand Next to AMD Device on PCB

UNSW Sydney, Pawsey, and AMD Collaborate to Democratize DNA Analysis

DNA sequencing is entering a new era of speed and openness. Researchers at UNSW Sydney and AMD Research and Advanced Development (RAD) have jointly developed Slorado, the world’s first fully open-source tool for real-time nanopore DNA decoding. Built on AMD GPUs and the ROCm™ open-source platform, Slorado delivers high-performance, scalable basecalling to the genomics community, enabling scientists worldwide to analyze nanopore sequencing data on any GPU hardware. Slorado’s performance and scalability were demonstrated on the Setonix supercomputer at the Pawsey Supercomputing Research Centre, Australia.

Accelerating Genomics with Open Compute

Nanopore sequencing is transforming genomics with affordable, portable, and real-time DNA analysis. Devices from Oxford Nanopore Technologies (ONT) generate continuous electrical signals, or squiggles, that an AI model decodes into DNA bases (A, C, G, T). This decoding process is known as basecalling. ONT’s open-source Dorado basecaller has driven significant advances in this space, but until now, it has relied on closed-source NVIDIA components, limiting researchers to specific GPUs.

Slorado removes this barrier for researchers by offering a fully open-source alternative that supports the latest transformer-based basecalling models and executes efficiently on AMD Instinct™ and Radeon™ GPUs through ROCm. By delivering full transparency, portability, and performance, Slorado gives researchers the freedom to deploy nanopore workflows across diverse computing environments, from supercomputers to local GPU workstations.

Demonstrated on Australia’s Flagship AMD-Powered Supercomputer

The Setonix supercomputer at the Pawsey Supercomputing Research Centre, powered by AMD Instinct MI250X GPUs, served as the primary testbed for Slorado’s performance validation.

Setonix Supercomputer
Setonix supercomputer at the Pawsey Supercomputing Research Centre.

A typical nanopore sequencing run of a human genome generates approximately 1 terabyte of raw signal data over 48 hours. By leveraging multi-node scaling on Setonix, researchers can process these datasets in parallel, dramatically accelerating time-to-insight and enabling large-scale genomic studies. Slorado processed a human genome dataset in as little as 2.3 hours, dramatically faster than traditional workflows.

Total Basecalling Time for a Complete HG002 ONT PromethION Genomic DNA Dataset
Figure 1: Total basecalling time (hours) for the HG002 ONT PromethION dataset using Slorado on AMD Instinct MI250X and AMD Instinct MI300X GPUs.

Figure 1 compares total basecalling time across AMD Instinct MI250X and AMD Instinct MI300X GPUs for the FAST, HAC, and SUP v5 models. On Pawsey’s Setonix system, which is powered by four AMD Instinct MI250X accelerators per node, Slorado achieved total basecalling times of 2.3 hours for the FAST model, 5.4 hours for the HAC model, and 15.4 hours for the computationally intensive SUP model when processing a complete HG002 ONT PromethION dataset. These results demonstrate strong throughput and efficient scaling on the MI250X architecture currently deployed in large HPC environments. Slorado was also evaluated on an eight-way configuration of AMD Instinct MI300X, a more recent accelerator generation designed to support higher memory capacity and AI-centric workloads. On MI300X, the same dataset completed in 0.8 hours for FAST, 2.3 hours for HAC, and 3.7 hours for SUP, illustrating the significant performance uplift available on this generation and the efficiency gains Slorado achieves when paired with higher-bandwidth, higher-capacity GPU architectures.

This work was led by Dr. Hasindu Gamaarachchi, together with his PhD student, Bonson Wong from UNSW Sydney.

Basecalling on AMD GPUs – Quick Start Guide

The easiest way to get started is by using the precompiled Slorado binaries for Linux.

Step 1: Download the latest x86_64-rocm-linux binaries

		VERSION=v0.3.0-beta
wget "https://github.com/BonsonW/slorado/releases/download/$VERSION/slorado-$VERSION-x86_64-rocm-linux-binaries.tar.xz"
	

Step 2: Extract the tarball and verify installation

		tar xvf slorado-$VERSION-x86_64-rocm-linux-binaries.tar.xz
cd slorado-$VERSION
bin/slorado --help
	

Step 3: Download the test dataset (20,000 reads)

		wget -O PGXXXX230339_reads_20k.blow5 https://slow5.bioinf.science/hg2_prom_5khz_subsubsample
	

Step 4: Run the basecaller with a model of your choice

Example using the v5.0.0 High Accuracy model:

		./bin/slorado basecaller models/dna_r10.4.1_e8.2_400bps_hac@v5.0.0 PGXXXX230339_reads_20k.blow5 -o out.fastq
	

If the binaries are incompatible with your system, Slorado can be compiled from source following the instructions at:

https://bonsonw.github.io/slorado/docs/rocm-build.html

Open Science. Open Compute. Open Future for Research.

Slorado demonstrates how open science and open computing can work together to advance discovery. By integrating cutting-edge AI-based basecalling methods with the open ROCm software stack, AMD and its collaborators are empowering researchers to push the boundaries of genomic analysis without being constrained by proprietary platforms.

Through initiatives like Slorado, AMD continues to enable a new generation of scientific computing that is scalable, transparent, and accessible to all.

Disclaimer

Slorado incorporates components licensed under the Oxford Nanopore Technologies PLC Public License v1.0. Use is permitted for research purposes only.

Share:

Article By


Senior Member of Technical Staff

Contributors


Senior Lecturer, UNSW

PhD student at UNSW

Life Sciences Supercomputing Specialist, Pawsey

Fellow, Research and Advanced Development

PMTS, Research and Advanced Development

Related Blogs