Skip to main content

Evolving Comms Libraries in ROCm for Future AI Workloads

abstract background

Abstract

This talk covers advances in AMD ROCm communication libraries for large-scale AI. RCCL innovations include copy-engine offloading, symmetric memory, GPU-initiated collectives, congestion-aware load balancing, and port failover. rocSHMEM extends an OpenSHMEM-like model with Python and Triton support, while NIXL contributions enable efficient inference. Together, they support diverse NICs like AMD Pensando AINICs and Broadcom Thor.

July 22, 2026 16:30 - 16:50

Speakers


Presented By