Skip to main content

Accelerating LLM Inference on AMD ROCm with AITER and ATOM

abstract background

Abstract

This technical talk introduces AITER and ATOM, optimized inference technologies for AMD ROCm software. Learn how AITER accelerates LLM and MoE execution with optimized kernels and distributed inference enhancements, while ATOM integrates these capabilities into familiar vLLM and SGLang workflows through plugin-based acceleration. The session highlights how AMD enables scalable, high-performance open-source LLM serving while preserving existing developer and deployment workflows.

July 23, 2026 13:00 - 13:20

Speakers