Skip to main content

Powering Open-Model Inference at Scale with TensorWave and Featherless AI

abstract background

Abstract

The growth of open-weight models has created a new infrastructure challenge: serving many distinct models at scale without the constraints of closed ecosystems. This session shows how TensorWave built an AMD-first cloud on AMD Instinct and ROCm to support open-model inference at production scale, and how Featherless AI is applying that infrastructure to serve hundreds of models with practical lessons on throughput, cost, and ecosystem readiness.

July 23, 2026 1:00 PM - 1:45 PM PDT

Speakers


Presented By