Uploaded image for project: 'Red Hat Advanced Cluster Management'
  1. Red Hat Advanced Cluster Management
  2. ACM-20173

Metrics on managed cluster to retrieve Klusterlet status alert in case of primary hub cluster down/disaster

XMLWordPrintable

    • Icon: Feature Feature
    • Resolution: Unresolved
    • Icon: Normal Normal
    • None
    • ACM 2.8.Z, ACM 2.13.0, MCE 2.10.0
    • Server Foundation
    • False
    • Hide

      None

      Show
      None
    • False
    • Not Selected

      Hello Team,

      The customer wants to have a Metric to retrieve Klusterlet status alert in case of RHACM hub cluster down/disaster.

      He is testing a RHACM failover scenario :

      We have 2 RHOCP clusters 4.12.42 and 4.14.42 version, already on-boarded on the active/hub RHACM cluster(Advanced Cluster Management for Kubernetes version 2.8.2).

      At some point the active RHACM suffers a disaster(we simulated this by shutting down the RHACM cluster).

      This can be seen from the managed clusters by checking the Klusterlet status, which is in this situation "HubConnectionDegraded", with Reason "BootstrapSecretError,HubKubeConfigError" and in the Message is clearly displayed that the API call to the RHACM is responding with EOF.

      We have not identified any metric/servicemonitor on the managed clusters after the on-boarding process to help with checking this.

      So, the customer wants to have a Metric to retrieve Klusterlet status alerts in case the primary ACM hub cluster is down.

              Unassigned Unassigned
              rhn-support-cchouhan Chandan Chouhan
              Hui Chen Hui Chen
              Votes:
              0 Vote for this issue
              Watchers:
              2 Start watching this issue

                Created:
                Updated: