Uploaded image for project: 'OpenShift Request For Enhancement'
  1. OpenShift Request For Enhancement
  2. RFE-7743

Must gather all images. Application of consistent Day-2 Operator Annotations.

XMLWordPrintable

    • Icon: Feature Request Feature Request
    • Resolution: Won't Do
    • Icon: Undefined Undefined
    • None
    • None
    • None
    • None
    • Product / Portfolio Work
    • None
    • False
    • Hide

      None

      Show
      None
    • None
    • None
    • None
    • None
    • None
    • None
    • None
    • None

      Problem Statement

      When evaluating OpenShift Container Platform cluster configuration the oc adm must-gather tool is indispensable for collecting diagnostic information. A critical feature, must-gather --all-images, is designed to list all container images required by the cluster's deployed components, which is vital for setting up disconnected environments, pre-pulling images, and validating image availability.

      However, many "Day-2" Operators, do not include the necessary Kubernetes annotations on their operand deployments or custom resources defined in the ClusterServiceVersion. This omission results in the must-gather --all-images command providing an incomplete list of required images, forcing manual and potentially error-prone discovery processes for critical components.

      This gap is particularly problematic in:

      • Complex deployments with numerous Day-2 Operators.
      • Scenarios where all logs, configuration, etc. required for troubleshooting needs to be gathered at the same time to enable rapid Root Cause Analysis (RCA) and to avoid support escalation.

      Proposed Solution (Enhancement)

      In order to be able to use the must-gather --all-images feature the Red Hat-supported Day-2 Operators reference in this document must be updated to consistently include the relevant and accurate Kubernetes annotations on their deployed operands and associated Custom Resources (CRs). This is to ensure that must-gather --all-images can correctly identify and list all operators for the purpose of checking the deployed state of the Operator's configuration.

      Extra Information
      This link explains how must-gather works with the –all-images parameter.  https://github.com/openshift/enhancements/blob/master/enhancements/oc/must-gather.md

      Operators include a special annotation in their CSV (ClusterServiceVersion), typically like:

      annotations:
      operators.openshift.io/collection-image: "quay.io/my-org/my-operator-must-gather:latest"

      Specific Day-2 Operators for annotation

      The Day-2 Operators are as follows.   

      ACM - Advanced Cluster Management

      CLO - Cluster Logging Operator

      Compliance Operator

      KMM - Kernel Module Management Operator

      LCA Operator

      LSO - Local Storage Operator

      LVMS Operator

      MCE - MultiCluster Engine

      OADP - OpenShift API for Data Protection

      ODF - OpenShift Data Foundation

      OpenShift GitOps Operator

      SRIOV Network Operator

      TALM - Topology Aware Lifecycle Manager

      Why does the customer need this? 

      Ensuring Day-2 Operators properly annotate their images for must-gather --all-images provides significant benefits, especially for our partners and customers:

      • Validation of Reference Architectures:
      • The ability to collect a complete accurate snapshot of the cluster's current configuration state through must-gather is a critical aspect of the blueprint design validation and deviation detection process.
      • Reduced Partner/Customer Burden:
      • Currently, Red Hat support teams, professional services and partners must manually inspect Operator manifests, pod definitions, and ClusterServiceVersion (CSV) resources to piece together a complete view of the Day-2 Operators deployed. This has the potential to be time-consuming and error-prone. Correct annotations automate this burden.
      • Enhanced Operational Consistency and Automation:
      • Enables automated workflows for cluster configuration validation and security scanning.
      • Provides a standardized way for partners and customers to gather must-gathers for the complete deployment without requiring knowledge of the individual must-gather images that should be used for each operator
      • Simplified and Automated Troubleshooting:
      • Comprehensive Data Collection: The all-images option relies on these annotations to build a complete list of images used by the cluster. Without them, must-gather misses critical images, making initial diagnostics incomplete and delaying troubleshooting.
      • Reduced Mean Time To Resolve (MTTR): Field engineers and Red Hat Support can obtain a full and accurate list of images in one command, drastically reducing the time spent manually identifying missing images or verifying image versions. This accelerates the problem identification and resolution process.

      Improved Support Experience: Customers receive faster, more accurate support when comprehensive data can be gathered efficiently, preventing multiple back-and-forths for image information.

              rh-ee-dsreeniv Deepak Sreenivas
              rh-ee-ajoyce Antony Joyce
              None
              Votes:
              0 Vote for this issue
              Watchers:
              5 Start watching this issue

                Created:
                Updated:
                Resolved:
                None
                None