Transitional approach to implementing pragmatic Site Reliability Engineering (SRE) Technical Overview


Course description

Learn why SRE has emerged as an IT strategy to deliver improved performance and time to market

Site Reliability engineering (SRE) is a shared responsibility model, and if executed well can improve efficiency, resiliency, and security. But implementing SRE in an organization requires cultural shift, team shaping and training, and process changes. To achieve this shift, organizations need to create a roadmap for the transition. This technical overview covers the pragmatic approach to SRE and introduces the core tools for shared responsibility to enable an incremental transition to SRE.

Course summary

Through online, on-demand videos you will learn about:

  • What is SRE? A pragmatic approach to SRE
  • Tools for shared responsibility
  • How to scale service reliability
  • Creating a ‘safe to fail’ culture
  • Assessing inconsistencies and how they generate toil
  • Importance of aligning ITSM and DevOps/SRE
  • Team shaping - build/run teams
  • Defining and aligning goals of incident management and DevOps
  • Collaboration hacks - things you can use now to get started in transition
  • Institutionalize Metrics
  • Summary - a pragmatic approach

Audience for this course

IT decision makers and leaders considering or in the process of implementing or improving their SRE practices.

Prerequisites for this course

There are no prerequisites for this course.


Build your skills path

Take this course as part of a Red Hat Learning Subscription, which gives you on-demand, unlimited access to our online learning resources for an entire year.

Verify your knowledge

Take a free skills assessment to test your expertise, determine gaps and get recommendations for where to start with Red Hat training.