Maybe this is too obvious for others out there, but a book I would recommend for sysadmins is Site Reliability Engineering (SRE), edited by Betsy Beyer, Chris Jones, et al. It’s not an obscure choice by any means. This book might be one of the best known titles to sysadmins everywhere. I recommend it because it’s easy to ignore, but—I think— game-changing in its own right.

For years, I’d ignored this SRE book on the basis that anything Google-scale could not possibly apply to what I did on a day-to-day basis. I reasoned that the masses of online discussion could be chalked up to the fanboys and fangirls. Certainly, after a decade as a sysadmin, nothing truly new would be included in what was essentially a sysadmin handbook.

I was wrong. When I did finally pick up a copy and start to read it, my mind was changed within a few chapters. No, there’s no magical recipe for perfect system administration. Yes, it describes a job that focuses heavily on programming rather than "traditional" system administration. No, it is not a manual about how to be a system administrator.

Site Reliability Engineering describes exactly the challenges facing my team. We’re handling more servers per sysadmin than ever before—a ratio of hundreds-to-one where ten years ago it was dozens-to-one. Even with better automation tools and increased scripting, trying to handle that scale is challenging, and a new workflow has to be developed to deal with the load.

SREs are arguably not sysadmins as we know the term, but they are the next generation of operations staff. This book discusses well-thought-out steps to transition a team from traditional sysadmins to a team of SREs, including the skills needed, practices to put into place as a team, and policies from leadership that support and enhance these changes. It is well worth the read, even as a single contributing individual.


关于作者

Chris Collins is an SRE at Red Hat and a Community Moderator for Opensource.com. He is a container and container orchestration, DevOps, and automation evangelist, and will talk with anyone interested in those topics for far too long and with much enthusiasm.

UI_Icon-Red_Hat-Close-A-Black-RGB

按频道浏览

automation icon

自动化

有关技术、团队和环境 IT 自动化的最新信息

AI icon

人工智能

平台更新使客户可以在任何地方运行人工智能工作负载

open hybrid cloud icon

开放混合云

了解我们如何利用混合云构建更灵活的未来

security icon

安全防护

有关我们如何跨环境和技术减少风险的最新信息

edge icon

边缘计算

简化边缘运维的平台更新

Infrastructure icon

基础架构

全球领先企业 Linux 平台的最新动态

application development icon

应用领域

我们针对最严峻的应用挑战的解决方案

Virtualization icon

虚拟化

适用于您的本地或跨云工作负载的企业虚拟化的未来