Simplifying OpenShift Case Information Gathering Workflow: Must-Gather Operator

2020년 1월 6일2분 읽기컨테이너

Senior Principal Architect

Introduction

Collecting debugging information from a large set of nodes (such as when creating SOS reports) can be a time consuming task to perform manually. Additionally, in the context of Red Hat OpenShift 4.x and Kubernetes, it is considered a bad practice to ssh into a node and perform debugging actions. To better accomplish this type of operation in OpenShift Container Platform 4, there is a new command: oc adm must-gather, which will collect debugging information across the entire cluster (nodes and control plane). More detailed information on the must-gather command can be found in the platform documentation.

While using the must-gather command is fairly straightforward, the full end-to-end process to facilitate all of the available tasks can be time consuming. This process involves issuing the command, waiting for the associated tasks to complete, and then upload the resulting information to the Red Hat case management system.

A way to further streamline the process is to automate these actions.

Must-Gather Operator

The must-gather operator streamlines running the must-gather command and uploading the results to the Red Hat case management system. The must-gather operator is intended to be used only by the cluster administrator as it requires elevated permissions on the cluster. A must-gather run can be started by creating a MustGather custom resource (CR) similar to the following:

apiVersion: redhatcop.redhat.io/v1alpha1

kind: MustGather

metadata:

spec:

caseID: 'XXXXXXXX'

caseManagementAccountSecretRef:

serviceAccountRef:

Within the MustGather CR, three parameters can be defined:

caseID. Red Hat Support case to which the resulting output will be attached.
caseManagementAccountSecretRef: secret containing the credentials needed to login and upload files to the Red Hat case management system.
serviceAccountRef: service account with the cluster-admin role that is used to run the must-gather command. Running as a cluster-admin is a must-gather requirement.

When this CR is created, the operator creates a job that runs must-gather operations, and uploads the resulting information in a compressed file.

The must-gather operator watches only the namespace in which it is deployed. This should make it easier for a cluster administrator to configure limited access to that namespace. This is recommended as that namespace needs to contain a service account with cluster-admin privileges for the reason seen before and therefore needs to be properly protected.

Running Additional Must-Gather Images

The must-gather command supports the option of running multiple must-gather compatible images that can be used for collecting additional information. This option is typically limited to OpenShift addons, such as Kubevirt and OpenShift Container Storage (OCS). The must-gather operator supports this functionality by allowing these images to be specified as in the following example:


apiVersion: redhatcop.redhat.io/v1alpha1

kind: MustGather

metadata:

spec:

caseID: 'XXXXXXX'

caseManagementAccountSecretRef:

serviceAccountRef:

mustGatherImages:

- quay.io/kubevirt/must-gather:latest

- quay.io/ocs-dev/ocs-must-gather

As you can see, the mustGatherImages property is an array of strings representing images. When added to a must-gather CR, all the specified images in addition to the default must gather image will be run.

Installation

The must gather operator can be installed via the OperatorHub or with a Helm chart.

The project GitHub repository contains detailed information on how to install the must-gather operator.

Conclusions

Being able to provide diagnosis information in a consistent fashion makes it easier for Red Hat support to aid in the resolution of issues. A more streamlined and automatic information collecting process makes it more likely for the customer to be able to provide timely debugging information to Red Hat support. The must-gather operator aims to help in this space.

저자 소개

Raffaele Spazzoli

Senior Principal Architect

Raffaele is a full-stack enterprise architect with 20+ years of experience. Raffaele started his career in Italy as a Java Architect then gradually moved to Integration Architect and then Enterprise Architect. Later he moved to the United States to eventually become an OpenShift Architect for Red Hat consulting services, acquiring, in the process, knowledge of the infrastructure side of IT.

Currently Raffaele covers a consulting position of cross-portfolio application architect with a focus on OpenShift. Most of his career Raffaele worked with large financial institutions allowing him to acquire an understanding of enterprise processes and security and compliance requirements of large enterprise customers.

Raffaele has become part of the CNCF TAG Storage and contributed to the Cloud Native Disaster Recovery whitepaper.

Recently Raffaele has been focusing on how to improve the developer experience by implementing internal development platforms (IDP).

유사한 검색 결과

Blog post

채널별 검색

모든 채널 탐색

Simplifying OpenShift Case Information Gathering Workflow: Must-Gather Operator

Introduction

Must-Gather Operator

Running Additional Must-Gather Images

Installation

Conclusions

저자 소개

Raffaele Spazzoli

유사한 검색 결과

채널별 검색

플랫폼

툴

체험, 구매 & 영업

커뮤니케이션

Red Hat 소개

페이지 언어 변경

Red Hat legal and privacy links

Red Hat legal and privacy links