Red Hat OpenShift AI is an MLOps and GenAIOps platform built with open source technologies, providing trusted and operationally consistent capabilities for teams to experiment, serve models, and build innovative applications. OpenShift AI accelerates the delivery of AI-enabled applications, helping organizations move from early pilots into operationally reliable deployments with greater speed and control.
The platform offers an integrated user interface (UI) experience with tooling for building, training, tuning, deploying, and managing predictive and gen AI models. Organizations can deploy models to hybrid cloud environments with a specific emphasis on providing a controlled and protected environment for sovereign and private AI. This approach ensures that sensitive data and AI models remain within designated geographic or organizational boundaries, meeting strict regulatory and compliance requirements.
Simplify AI adoption
As an add-on to Red Hat OpenShift, OpenShift AI provides a platform designed to increase AI adoption and trust by combining open source communities with a reliable AI ecosystem. This offers increased flexibility and freedom to select the right AI/ML technology for each organization. Users can build their predictive models or start with an external gen AI model, then enhance it with retrieval-augmented generation (RAG) using 1 of several model servers provided in the platform. The platform offers access to optimized and validated third-party models—such as Llama, Mistral, DeepSeek, Qwen, Kimi, and Granite—that run efficiently on vLLM (available on the Red Hat AI repository on Hugging Face). The catalog lets users explore these models and add their own. The OpenShift AI dashboard provides a central place to discover and access all applications and documentation, which simplifies adoption.
Ensure operational consistency across teams
OpenShift AI provides a consistent user experience that empowers data scientists, AI engineers, developers, and DevOps teams to collaborate effectively to deliver timely AI solutions. It offers self-service access to collaborative workflows, graphics processing unit (GPU) acceleration, and streamlined operations, providing a consistent delivery of AI solutions at scale across hybrid cloud environments and at the network edge.
IT operations benefit from simplified configurations and more automated workflows on a proven platform that can scale up or down with low effort, while providing reliable governance and security controls.
Gain hybrid cloud flexibility
With OpenShift AI, organizations can train, deploy, and manage AI/ML workloads across various clouds, on-premise datacenters, or air-gapped environments to meet regulatory, security, and data requirements. The platform is compatible with multiple AI accelerators from vendors like NVIDIA, AMD, Intel, IBM, Google, and Amazon Web Services (AWS). This capability can expand to a GPU-as-a-Service environment, so organizations can centrally manage, partition, and schedule GPU resources, while also providing detailed observability into their use.
Gen AI and agentic AI
For gen AI projects, organizations can get dedicated user experiences through components like AI hub, which provides a dashboard experience for platform engineers. AI hub consolidates catalog, registry, and model deployments, allowing teams to set up and deploy models, as well as discover and deploy verified Model Context Protocol (MCP) servers (developer preview). The experience also features Open Container Initiative (OCI)-compliant storage, artifact signing (tech preview), Hugging Face integration, and embedded performance insights to help govern how agents interact with internal systems.
Gen AI studio provides a hands-on environment to discover and interact with models, experiment in a playground environment, tune hyperparameters, and quickly prototype gen AI applications. It speeds development by adding side-by-side chat comparisons (tech preview), centralized prompt management and versioning, vector store integrations (tech preview), and an embedded MLflow user UI for end-to-end agentic traceability (tech preview).
OpenShift AI accelerates agentic AI by providing a unified application programming interface (API) layer and a flexible, scalable foundation. The platform's MCP support acts as a standardized integration layer to govern how agents interact with external tools. OpenShift AI operationalizes agentic workflows with AgentOps capabilities, including embedded MLflow for end-to-end agent traceability and tool-use logging, EvalHub for scientifically benchmarking and scoring AI agents (tech preview), and automated adversarial vulnerability scanning to proactively catch prompt injections and ensure agent safety before deployment (tech preview).
Additional tools include EvalHub (tech preview), which advances large language model (LLM) evaluation and benchmarking to support evaluation-driven development. EvalHub provides a unified interface to scientifically validate the performance, safety, and adversarial resilience of models, RAG pipelines, and agentic workflows prior to real-world deployment. To assist with efficient inference, tools like LLM Compressor and speculative decoding provide algorithms to reduce the size of custom models and increase response speeds by 2–3 times without quality loss, using methods similar to the ones Red Hat uses to create validated and optimized models.