Understanding AI

Artificial intelligence (AI) refers to computer science processes and statistical algorithms that simulate and augment human intelligence. 

Red Hat AI Enterprise

November 5, 2025
An integrated AI platform for building and running efficient, cost-effective models, agents, and applications on any infrastructure.

Foundations of AI

Article

What is machine learning?

April 15, 2026
Machine learning is the technique of training a computer to find patterns, make predictions, and learn from experience without being explicitly programmed.
Artificial intelligence
Article

What is deep learning?

April 15, 2026
Deep learning is an artificial intelligence (AI) technique that teaches computers to process data using an algorithm inspired by the human brain.
Artificial intelligence
Article

What are foundation models for AI?

April 15, 2026
A foundation model is a type of machine learning (ML) model that is pre trained to perform a range of tasks.
Artificial intelligence
Article

What are large language models?

February 17, 2026
A large language model (LLM) is a type of artificial intelligence that uses machine learning techniques to understand and generate human language.
Artificial intelligence
Article

SLMs vs LLMs: What are small language models?

February 10, 2026
A small language model (SLM) is a smaller version of a large language model (LLM) that has more specialized knowledge, is faster to customize, and more efficient to run.
Artificial intelligence
Article

What is AI inference?

February 18, 2026
AI inference is when an AI model provides an answer based on data. It's the final step in a complex process of machine learning technology.
AI inference, Artificial intelligence
Article

AI infrastructure explained

April 15, 2026
AI infrastructure combines artificial intelligence and machine learning (AI/ML) technology to develop and deploy reliable and scalable data solutions.
Artificial intelligence
Article

What is an AI platform?

April 14, 2026
An AI platform is an integrated collection of technologies to develop, train, and run machine learning models.
Artificial intelligence

Types of AI

Article

What is generative AI?

February 25, 2026
Generative AI is a kind of artificial intelligence technology that relies on deep learning models trained on large data sets to create new content.
Artificial intelligence
Article

Predictive AI vs. generative AI

February 10, 2026
Both gen AI and predictive AI have significant differences and use cases. As AI evolves, distinguishing between these different types helps clarify their distinct capabilities.
Artificial intelligence
Article

What is agentic AI?

April 15, 2026
Agentic AI is a software system designed to interact with data and tools in a way that requires minimal human intervention.
Artificial intelligence
Article

Agentic AI vs. generative AI

April 15, 2026
Agentic AI and generative AI explained: Learn how each works, their unique strengths, and how they can collaborate for smarter solutions.
Artificial intelligence

Model enhancements

Article

What is retrieval-augmented generation?

April 15, 2026
Retrieval-augmented generation (RAG) links external resources to an LLM to enhance a generative AI model’s output accuracy.
Artificial intelligence
Article

RAG vs. fine-tuning

February 17, 2026
RAG and fine-tuning both aim to improve LLMs, but use different methods. RAG avoids altering the model, while fine-tuning requires adjusting its parameters.
Artificial intelligence
Article

What is parameter-efficient fine-tuning (PEFT)?

April 16, 2026
PEFT is a set of techniques that adjusts only a portion of parameters within an LLM to save resources.
Artificial intelligence
Article

LoRA vs. QLoRA

February 12, 2025
LoRA (Low-Rank adaptation) and QLoRA (quantized Low-Rank adaptation) are both techniques for training AI models.
Artificial intelligence
Article

What is InstructLab?

October 6, 2025
InstructLab simplifies the process of customizing large language models with private data.
Artificial intelligence, Open source
Article

What is vLLM?

April 6, 2026
vLLM is a collection of open source code that helps language models perform calculations more efficiently.
Artificial intelligence
Article

What is Model Context Protocol (MCP)?

April 15, 2026
Learn how Model Context Protocol (MCP) connects AI applications to external data sources, helping you build smarter workflows.
Artificial intelligence
Article

What is Model-as-a-Service?

April 14, 2026
Model-as-a-Service (MaaS) is an approach to delivering AI models as shared resources, allowing users within an organization to access them on demand.
Artificial intelligence
Article

What are Granite models?

February 11, 2026
IBM's Granite models are a series of LLMs created for enterprise applications. Granite models can support gen AI use cases that use language and code.
Artificial intelligence

AI at scale

Article

What is sovereign AI?

April 15, 2026
Sovereign AI is about owning AI technology, keeping data local, and ensuring your systems reflect your unique values and legal requirements.
Artificial intelligence
Article

What is llm-d?

February 18, 2026
llm-d is a Kubernetes-native, open source framework that speeds up distributed LLM inference at scale.
Artificial intelligence
Article

What is distributed inference?

April 15, 2026
Distributed inference lets AI models process workloads more efficiently by dividing the labor of inference across a group of interconnected devices.
Artificial intelligence
Article

What is enterprise AI?

April 14, 2026
Enterprise AI is the integration of artificial intelligence (AI) tools and machine learning software into large scale operations and processes. Now, businesses can solve problems in weeks rather than years.
Artificial intelligence
Article

What is edge AI?

February 17, 2026
Edge artificial intelligence (AI) is the implementation of artificial intelligence in an edge computing environment.
Artificial intelligence, Edge computing
Article

What is MLOps?

April 15, 2026
Machine learning operations (MLOps) is a set of workflow practices aiming to streamline the process of deploying and maintaining machine learning (ML) models.
Artificial intelligence
Article

What is LLMOps?

April 15, 2026
Large Language Model Operations (LLMOps) Large Language Model Operations (LLMOps) are operational methods used to manage large language models.
Artificial intelligence
Article

AIOps explained

April 16, 2026
AIOps (AI for IT operations) is an approach to automating IT operations with machine learning and other advanced AI techniques.
Artificial intelligence
Article

What is AI security?

April 14, 2026
AI security defends AI applications against malicious attacks that aim to weaken AI workloads, manipulate data, or steal sensitive information.
Artificial intelligence, Security
Article

Understanding AI/ML use cases

April 15, 2026
It is important to understand the benefits and prepare for the challenges of AI/ML that are specific to your business processes and workloads.
Artificial intelligence
Article

What is AI in healthcare?

April 15, 2026
Discover the benefits and challenges of AI in healthcare and how Red Hat is helping the industry.
Artificial intelligence
Article

AI in banking

February 10, 2026
The next era of artificial intelligence (AI) and machine learning (ML) in banking will no doubt see improvements in customer service and operational efficiency.
Artificial intelligence
Article

Understanding AI in telecommunications

April 15, 2026
Learn how the right IT solutions can help your telco use AI efficiently and cost-effectively to overcome common challenges.
Artificial intelligence

Red Hat AI use cases

Generative AI

Generative AI

Produce new content, like text and software code. 

Red Hat AI lets you run the generative AI models of your choice, faster, with fewer resources, and lower inference costs. 

Predictive AI

Predictive AI

Connect patterns and forecast future outcomes. 

With Red Hat AI, organizations can build, train, serve and monitor predictive models, all while maintaining consistency across the hybrid cloud.

Operationalized AI

Operationalized AI

Create systems that support the maintenance and deployment of AI at scale. 

With Red Hat AI, manage and monitor the lifecycle of AI-enabled applications while saving on resources and ensuring compliance with privacy regulations. 

Agentic AI

Agentic AI

Build workflows that perform complex tasks with limited supervision. 

Red Hat AI provides a flexible approach and stable foundation for building, managing and deploying agentic AI workflows within existing applications.

Red Hat AI portfolio

Scale your AI foundation

  • Customize models with control.
  • Optimize resource allocation.

Optimize model performance 

  • Fast inference at scale.
  • Powered by vLLM.

Build and deploy AI applications

  • Manage the full AI lifecycle.
  • Implement AI guardrails. 

Run LLMs on an individual server

  • Develop, test, and run gen AI.
  • Fast, flexible inference. 

AI customer stories from Red Hat Summit and AnsibleFest 2025

Turkish Airlines

Turkish Airlines doubled the speed of deployment times with organization-wide data access.

JCCM Logo

JCCM improved the region's environmental impact assessment (EIA) processes using AI.

DenizBank

Denizbank sped up time to market from days to minutes.

Hitachi logo

Hitachi operationalized AI across its entire business with Red Hat AI.