Skip to main content

Building Next-Gen AI Infrastructure: Scaling Enterprise LLM Serving with RadixArk

abstract background

Abstract

Deploying generative AI in production requires a robust, scalable, and highly optimized infrastructure. Join Ying Sheng, founder of SGLang and RadixArk, to discover how to build a unified and efficient LLM serving stack. This session will cover the transition from open-source innovations in SGLang (like fast structured generation and advanced scheduling) to enterprise-ready deployments with RadixArk. We will demonstrate how our advanced serving engines maximize compute utilization, minimize latency, and scale AI applications seamlessly across diverse environments—ultimately building the foundations needed to bring the power of frontier AI to everyone.

July 22, 2026 14:30 - 14:50

Speakers


Presented By