-
Feature
-
Resolution: Unresolved
-
Normal
-
None
-
ACM 2.8.Z, ACM 2.13.0, MCE 2.10.0
-
False
-
-
False
-
Not Selected
Hello Team,
The customer wants to have a Metric to retrieve Klusterlet status alert in case of RHACM hub cluster down/disaster.
He is testing a RHACM failover scenario :
We have 2 RHOCP clusters 4.12.42 and 4.14.42 version, already on-boarded on the active/hub RHACM cluster(Advanced Cluster Management for Kubernetes version 2.8.2).
At some point the active RHACM suffers a disaster(we simulated this by shutting down the RHACM cluster).
This can be seen from the managed clusters by checking the Klusterlet status, which is in this situation "HubConnectionDegraded", with Reason "BootstrapSecretError,HubKubeConfigError" and in the Message is clearly displayed that the API call to the RHACM is responding with EOF.
We have not identified any metric/servicemonitor on the managed clusters after the on-boarding process to help with checking this.
So, the customer wants to have a Metric to retrieve Klusterlet status alerts in case the primary ACM hub cluster is down.