Skip to main content

Accelerating vLLM Inference on AMD Instinct GPUs with AMD ATOM

abstract background

Abstract

This advanced hands-on workshop introduces AMD ATOM an opensource optimized LLM inference backend for ROCm. Learn to serve LLMs with popular workflows using AMD-optimized attention & inference kernels. The Workshop introduces out-of-tree plugins for existing vLLM & SGLang users & aims at demonstrating how ATOM preserves familiarity of the frameworks while accelerating model execution & boosting inference performance, bridging opensource frameworks with the AMD high-performance inference stack.

July 23, 2026 4:00 PM - 4:45 PM PDT

Speakers


Presented By