OVERVIEW
“Scaling Generative AI with Confidence: LLM-d and OpenShift for Distributed Inference”
As large language models grow in capability, they also grow in complexity—requiring GPU memory and compute beyond what most single systems can provide. For infrastructure and operations teams, this creates new challenges around deployment, scheduling, cost management, and reliability.
In this session, we’ll introduce LLM-d, an open, Kubernetes-native framework for distributed inference. You’ll learn how Red Hat is leading efforts across the community to shape LLM-d into a scalable, operator-friendly platform for production GenAI.
We’ll demonstrate how LLM-d integrates into OpenShift AI, supports multi-GPU workloads, and provides:
- Declarative model deployment using Kubernetes-native APIs
- Distributed serving for large models like Llama3 and Granite
Event details
Date: Thursday, 11 September, 2025
Time: 10:30 AM IST | 1 PM SGT | 3 PM AEST
Any questions, please reach out to Elisa Navarro.
Bryon Baker
AI Specialist Solution Architect- APAC, Red Hat
Bryon Baker is a Specialist Solution Architect for Red Hat Asia Pacific. Having previously held managerial, engineering and architecture positions at National Australia Bank, Rational Software (USA), Ivar Jacobson International, and Siemens Research (UK), Bryon brings over 30 years of diverse experience in the IT industry.
Bryon’s experience spans embedded systems, electronics engineering, systems engineering, enterprise architecture, AI/ML and product management. Bryon is most passionate about machine learning and distributed system architectures.
Bryon holds a Bachelor of Science from Deakin University with majors in Mathematics and Computer Science.
Li Ming Tsai
Principal AI Architect, APAC, Red Hat
Li Ming is an AI Architect and member of the APAC AI Specialist team, with over 20 years of experience in technology and engineering. He leads AI innovation to solve complex business challenges, drive customer outcomes, and shape strategic AI initiatives across the region.
Previously, he served as Chief Architect for the Public Sector at Red Hat Singapore, focusing on Cloud, Containers, and Automation. A long-time advocate of Open Source, Li Ming has spent over a decade working in High Performance Computing (HPC) and Big Data, with a strong background in engineering, R&D, consulting, and technology leadership.