ElMerFold: Exascale AI for Protein Structure Prediction with El Capitan

Name: ElMerFold: Exascale AI for Protein Structure Prediction with El Capitan
Start: 2026-07-22T15:50:00-07:00
End: 2026-07-22T16:10:00-07:00

DProtein structure prediction is foundational to modern biology, enabling breakthroughs in drug discovery, enzyme engineering, and AI-driven science. We present ElMerFold, a production-scale synthetic data generation workflow running on the El Capitan system at 11,000 nodes and 44,000 APUs. ElMerFold processes ~41 million proteins at 2,378 structures/s, achieving a 16.3× improvement over prior approaches and reaching 969 PFLOP/s FP32 inference performance.

July 22, 2026 3:50 PM - 4:10 PM PDT

AMD Fellow, HPC and Sovereign AI | AMD

Senior Computer Scientist | Lawrence Livermore National Laboratory

Topic

Enterprise AI

AI Training & Inference

Session Type

Tech Talk

Supercomputing for All: Bringing Exascale Innovation to Enterprise AI

HPE and AMD have built some of the world's most advanced supercomputers, including Frontier and El Capitan. Join HPE leaders who helped deploy these systems and learn how the technologies, expertise, and operational practices developed for exascale computing are enabling enterprise AI. Discover how organizations can accelerate AI adoption, scale workloads, and reduce deployment risk.;HPE and AMD have built some of the world's most advanced supercomputers, including Frontier and El Capitan. Join HPE leaders who helped deploy these systems and learn how the technologies, expertise, and operational practices developed for exascale computing are enabling enterprise AI. Discover how organizations can accelerate AI adoption, scale workloads, and reduce deployment risk.

July 23, 2026
Domain-Specific AI at Scale: Open Models, Post-Training, and AI Infrastructure

Learn how domain-specific AI moves beyond generic models using post-training, domain evals, and scalable open infrastructure. Using Open Telco Models as a case study, this session covers curated data, reward loops, unified training and serving, and AMD Instinct/ROCm-based stacks for building specialized AI systems at enterprise scale.;Learn how domain-specific AI moves beyond generic models using post-training, domain evals, and scalable open infrastructure. Using Open Telco Models as a case study, this session covers curated data, reward loops, unified training and serving, and AMD Instinct/ROCm-based stacks for building specialized AI systems at enterprise scale.

July 23, 2026
Unlocking Secure Enterprise Intelligence at Scale with Cisco

As organizations transition from AI experimentation to production-scale infrastructure, demand for high-performance compute must be matched by security and reliability. This session explores Cisco's vision for secure, high-performance AI environments and a framework for accelerating AI deployment while mitigating risks associated with large-scale data processing. Learn how the Cisco UCS C845A M8 and AMD are enabling the next generation of enterprise AI.;As organizations transition from AI experimentation to production-scale infrastructure, demand for high-performance compute must be matched by security and reliability. This session explores Cisco's vision for secure, high-performance AI environments and a framework for accelerating AI deployment while mitigating risks associated with large-scale data processing. Learn how the Cisco UCS C845A M8 and AMD are enabling the next generation of enterprise AI.

July 23, 2026
From Models to Production—A Blueprint for AI at Scale

Moving AI from training to production takes more than GPUs. Hear how Microsoft and Chai AI built scalable AI infrastructure on Vultr using AMD Instinct GPUs and ROCm. Learn best practices for data locality, secure networking, Kubernetes orchestration, benchmarking, cost optimization, and scale-out operations. Leave with a practical blueprint for deploying fast, portable, production-ready AI workloads.;Moving AI from training to production takes more than GPUs. Hear how Microsoft and Chai AI built scalable AI infrastructure on Vultr using AMD Instinct GPUs and ROCm. Learn best practices for data locality, secure networking, Kubernetes orchestration, benchmarking, cost optimization, and scale-out operations. Leave with a practical blueprint for deploying fast, portable, production-ready AI workloads.

July 23, 2026
Zyphra: Large-Model Training Lessons on AMD

Learn what it took to train ZAYA1-74B, a 74B-parameter mixture-of-experts model, end-to-end on AMD Instinct MI300X. This session shares key engineering lessons from designing an efficient training stack, optimizing long-context performance, and building a reinforcement learning pipeline for math, code, and agentic AI workloads. Discover practical insights for training and deploying large AI models on AMD infrastructure.;Learn what it took to train ZAYA1-74B, a 74B-parameter mixture-of-experts model, end-to-end on AMD Instinct MI300X. This session shares key engineering lessons from designing an efficient training stack, optimizing long-context performance, and building a reinforcement learning pipeline for math, code, and agentic AI workloads. Discover practical insights for training and deploying large AI models on AMD infrastructure.

July 23, 2026
Right Size Your Memory Footprint to Move IT Refresh Forward

Memory has rarely been in such short supply and is impeding customer data center refresh plans. In this interactive conversation, we’ll discuss tips and tools for right-sizing memory configurations to help move your data center efficiency initiatives forward and preserve ROI. Bring your questions and our experts will provide answers!;Memory has rarely been in such short supply and is impeding customer data center refresh plans. In this interactive conversation, we’ll discuss tips and tools for right-sizing memory configurations to help move your data center efficiency initiatives forward and preserve ROI. Bring your questions and our experts will provide answers!

July 23, 2026
Training at Scale with AMD Primus

Primus makes large-scale training on Instinct reliable, debuggable and highly performant. It supports the latest OSS training frameworks, models, and is expanding support to new, cutting-edge model architectures, training techniques, and datatypes. SOTA pre and post training performance with Primus, proven at scales of thousands of GPUs, positions an AMD Instinct GPU as a competitive solution for model development at frontier labs, enterprises, and AI startups.;Primus makes large-scale training on Instinct reliable, debuggable and highly performant. It supports the latest OSS training frameworks, models, and is expanding support to new, cutting-edge model architectures, training techniques, and datatypes. SOTA pre and post training performance with Primus, proven at scales of thousands of GPUs, positions an AMD Instinct GPU as a competitive solution for model development at frontier labs, enterprises, and AI startups.

July 23, 2026
Build an MRI Analysis Agent with AMD Blueprints

Build and deploy an AI-powered MRI analysis agent in minutes using the AMD mri-doc Solution Blueprint. Run a Gradio-based pipeline that accepts DICOM, NIfTI, and standard image formats, applies tissue segmentation and anomaly detection, and generates LLM-drafted clinical reports on AMD Instinct GPUs. Then customize: swap the LLM AIM, reuse an existing model endpoint, or extend the pipeline for your specific clinical workflow.;Build and deploy an AI-powered MRI analysis agent in minutes using the AMD mri-doc Solution Blueprint. Run a Gradio-based pipeline that accepts DICOM, NIfTI, and standard image formats, applies tissue segmentation and anomaly detection, and generates LLM-drafted clinical reports on AMD Instinct GPUs. Then customize: swap the LLM AIM, reuse an existing model endpoint, or extend the pipeline for your specific clinical workflow.

July 23, 2026

ElMerFold: Exascale AI for Protein Structure Prediction with El Capitan

Abstract

Speakers

Presented By

Related Sessions

AMD.com Feedback