Loading...

XML

Word

Printable

Type: Bug
Resolution: Done
Priority: Normal
Fix Version/s: None
Affects Version/s: ACM 2.14.0
Component/s: Observability
Labels:
None

Regression:
None

SFDC Cases Links:
SFDC Cases Open:
SFDC Cases Counter:

PX Impact Score:

Description of problem:

When deploying a cluster with policies that are expected to configure openshift-monitoring to run in infra nodes and observability is already enabled in RHACM, the policy can cause endless rollbacks of the openshift-monitoring configuration.

Version-Release number of selected component (if applicable):

2.14.0
OCP 4.18 but different versions should also have the same issue

How reproducible:

customer environment

Steps to Reproduce:

set policies to be used on deployment of OCP with RHACM
deploy OCP
...

Actual results:

```
2025-10-21T18:30:40.133335362Z 2025-10-21T18:30:40.133Z INFO controllers.ObservabilityAddon.cmoWatcher Detected excessive reconciliations triggered by CMO configurations, potentially resulting from reconciliation conflicts between operators. Degrading the addon status.

{"request": "openshift-monitoring/cluster-monitoring-config"}

```

Expected results:

no clash due to observability being enabled on the configuration in place

Additional info:

The contents of the configuration policy enforce this

                    enableUserWorkload: true
                    alertmanagerMain:
                      volumeClaimTemplate:
                        spec:
                          storageClassName: samplestorageclass
                          volumeMode: Filesystem
                          resources:
                            requests:
                              storage: 5Gi
                      nodeSelector:
                        node-role.kubernetes.io/infra: ""
                    prometheusK8s:
                      volumeClaimTemplate:
                        spec:
                          storageClassName: samplestorageclass
                          volumeMode: Filesystem
                          resources:
                            requests:
                              storage: 300Gi
                      nodeSelector:
                        node-role.kubernetes.io/infra: ""
                    prometheusOperator:
                      nodeSelector:
                        node-role.kubernetes.io/infra: ""
                    k8sPrometheusAdapter:
                      nodeSelector:
                        node-role.kubernetes.io/infra: ""
                    kubeStateMetrics:
                      nodeSelector:
                        node-role.kubernetes.io/infra: ""
                    telemeterClient:
                      nodeSelector:
                        node-role.kubernetes.io/infra: ""
                    openshiftStateMetrics:
                      nodeSelector:
                        node-role.kubernetes.io/infra: ""
                    thanosQuerier:
                      nodeSelector:
                        node-role.kubernetes.io/infra: ""

this is enough to cause the issue ; there is another policy in use but only this affects the configmap. prior versions of RHACM are also likely to behave the same way.

duplicates

ACM-14052 Move CMO configuration to CRD

Assignee:: Unassigned

Reporter:: Felix Dewaleyne

Votes:: 0 Vote for this issue

Watchers:: 2 Start watching this issue

Created:: 2025/11/13 10:57 AM

Updated:: 2025/11/13 3:26 PM

Resolved:: 2025/11/13 3:26 PM

Details

Description

Description of problem:

Version-Release number of selected component (if applicable):

How reproducible:

Steps to Reproduce:

Actual results:

Expected results:

Additional info:

Attachments

Issue Links

Easy Agile Planning Poker

Activity

People

Dates