Uploaded image for project: 'OpenShift Bugs'
  1. OpenShift Bugs
  2. OCPBUGS-62668

Removal of vSphere CSI driver removes telemetry rules

XMLWordPrintable

    • Icon: Bug Bug
    • Resolution: Unresolved
    • Icon: Normal Normal
    • 4.20.0
    • 4.17, 4.18, 4.19, 4.20, 4.21
    • Storage / Operators
    • None
    • Quality / Stability / Reliability
    • False
    • Hide

      None

      Show
      None
    • None
    • None
    • None
    • None
    • None
    • None
    • In Progress
    • Bug Fix
    • Hide
      Cause: When an administrator set the `managementState` of the vSphere CSI driver operator to `Removed`, the cleanup process would delete an associated `PrometheusRules` custom resource. This resource incorrectly contained both vSphere-specific alerts and telemetry recording rules.
      Consequence: Important telemetry metrics related to vSphere storage were lost, impacting cluster health monitoring and reporting.
      Fix: The telemetry recording rules have been moved into a separate `PrometheusRules` object that is no longer tied to the lifecycle of the vSphere CSI driver operator.
      Result: Now, removing the vSphere CSI driver operator correctly deletes only the vSphere-specific alerts, while the essential telemetry metrics are preserved.
      Show
      Cause: When an administrator set the `managementState` of the vSphere CSI driver operator to `Removed`, the cleanup process would delete an associated `PrometheusRules` custom resource. This resource incorrectly contained both vSphere-specific alerts and telemetry recording rules. Consequence: Important telemetry metrics related to vSphere storage were lost, impacting cluster health monitoring and reporting. Fix: The telemetry recording rules have been moved into a separate `PrometheusRules` object that is no longer tied to the lifecycle of the vSphere CSI driver operator. Result: Now, removing the vSphere CSI driver operator correctly deletes only the vSphere-specific alerts, while the essential telemetry metrics are preserved.
    • None
    • None
    • None
    • None

      When the vSphere CSI driver operator is removed (using managementState: Removed), OCP removes vSphere problem detector PrometheusRules. This object contains alert definitions which should be removed (VPD should not alert if the driver is removed), but it also contains metrics that are transported via telemetry and we do want to keep these.

              rhn-support-dfajmon Daniel Fajmon
              rhn-support-dfajmon Daniel Fajmon
              None
              None
              Wei Duan Wei Duan
              None
              Votes:
              0 Vote for this issue
              Watchers:
              3 Start watching this issue

                Created:
                Updated: