-
Task
-
Resolution: Done
-
Major
-
None
-
None
-
None
-
Quality / Stability / Reliability
-
False
-
-
False
-
-
Fleet Manager has logic for reconciling the add-on in the cluster reconciler.
If at least one request to OCM to get or update the add-on returns an error, then the alert is triggered. Sometimes it can be noisy because:
1) Sometimes ocm requests fail
2) Since we do not upgrade the addon frequently it has no meaningful impact.
Therefore we need to tune the alert rule so that it gets triggered if the error rate increases over a meaningful time interval.