AI010
Red Hat AI Inference Server Technical Overview
Optimize your AI workloads and reduce costs with Red Hat AI Inference Server.
Course Description
- Gain essential insights into AI deployment with this Red Hat AI Inference Server technical overview. Learn how to address the complexities and costs of running AI models in production. Discover how Red Hat's solution, powered by vLLM, optimizes performance and delivers significant cost savings across cloud, on-premise, virtualized, and edge environments. Dive into advanced techniques like quantization and speculative decoding to enhance your AI inference capabilities. This on-demand video content demonstrates seamless model deployment and management within OpenShift AI, showcasing how you can achieve unparalleled efficiency and flexibility for your AI workloads.
Course Content Summary
- What is Inference?
- Challenges with Inference
- Red Hat AI Inference Server Solution
- Red Hat AI Portfolio Integration
- Flexibility of Deployment
- LLM Compression Tool (Quantization)
- Performance Optimization Techniques (kV Cache, Speculative Decoding, Tensor Parallel Inference)
- Case Studies
- Model Deployment and Management
- Storage Connections for Models
- Metrics and Monitoring
- Hugging Face Integration
Audience for this course
- AI/ML Engineers and Practitioners
- DevOps Engineers
- Cloud Architects and Engineers
- Technical Decision-Makers
Recommended training
- There are no prerequisites for this Technical Overview.
Course Outline
- What is Inference?
- Challenges with Inference
- Red Hat AI Inference Server Solution
- Red Hat AI Portfolio Integration
- Flexibility of Deployment
- LLM Compression Tool (Quantization)
- Performance Optimization Techniques (kV Cache, Speculative Decoding, Tensor Parallel Inference)
- Case Studies
- Model Deployment and Management
- Storage Connections for Models
- Metrics and Monitoring
- Hugging Face Integration
Recommended next course or exam
More ways to master your skills
Get the best of both worlds: expert-led virtual training and self-paced learning, plus expert help and a certification exam. It’s all included in the Red Hat Learning Subscription.
On-site training available
If you would like to get your entire team trained, we can do it on your premises, in-person or remote.
Red Hat Learning Subscription
Comprehensive training and learning pathways on Red Hat products, industry-recognized certifications, and a flexible and dynamic IT learning experience.