-
Bug
-
Resolution: Done
-
Major
-
ACM 2.10.0, ACM 2.10.1
-
1
-
False
-
None
-
False
-
-
-
RHOBS Sprint 20
-
Yes
Description of problem:
Endpoint SA is a part of the resources to set up endpoint and metrics collector in the hub/local-cluster. Post ACM 8509, the resources are not managed under manifestworks unlike regular spokes. So any deletion/update of these resources needs to be explicitly handled and watched by the placement controller for the hub
Version-Release number of selected component (if applicable):
How reproducible:
- Delete endpoint SA in hub
- Workaround - delete MCO pod to recreate SA
Steps to Reproduce:
- ...
Actual results:
Expected results:
- New endpoint SA is created when deleted
Additional info:
Logs in metric collector when SA is deleted -
│ level=error caller=logger.go:60 ts=2024-04-16T10:00:36.262699118Z component=collectrule/evaluator msg="failed to evaluate collect rule" err="prometheus server forbidden: https://prometheus-k8s.openshift-monitor │
│ ing.svc:9091/api/v1/query?query=%281+-avg%28rate%28node_cpu_seconds_total%7Bmode%3D%22idle%22%7D%5B5m%5D%29%29%29%2A+100+%3E+70" rule="(1 - avg(rate(node_cpu_seconds_total{mode=\"idle\"}[5m]))) * 100 > 70" │
│ level=error caller=logger.go:60 ts=2024-04-16T10:00:36.277740782Z component=collectrule/evaluator msg="failed to evaluate collect rule" err="prometheus server forbidden: https://prometheus-k8s.openshift-monitor │
│ ing.svc:9091/api/v1/query?query=%281+-sum%28%3Anode_memory_MemAvailable_bytes%3Asum%29%2F+sum%28kube_node_status_allocatable%7Bresource%3D%22memory%22%7D%29%29+%2A+100+%3E+70" rule="(1 - sum(:node_memory_MemA │
│ vailable_bytes:sum) / sum(kube_node_status_allocatable{resource=\"memory\"})) * 100 > 70"
- links to
-
RHBA-2024:130356 Red Hat Advanced Cluster Management 2.10.2 bug fixes and container updates