Loading...

XML

Word

Printable

Type: Bug
Resolution: Done
Priority: Critical
Fix Version/s: None
Affects Version/s: 4.9.z
Component/s: Monitoring
Labels:
- telco-priority-1

Test Coverage:

-
Regression:
None
Blocked:
False
Blocked Reason:

Hide

None

Show
None
Internal Whiteboard:

SFDC Cases Counter:
SFDC Cases Open:
SFDC Cases Links:

Description of problem:

While testing ACM provisioning many SNOs via ZTP, and then ZTP upgrading those SNOs from 4.9.46 to 4.10.28, almost all of the common-config policy per clusters became NonCompliant.  Upon examining why the policies became NonCompliant, it seems the cluster-monitoring-config was changed/replaced during the upgrade.

Version-Release number of selected component (if applicable):

4.9.46 -> 4.10.28

How reproducible:

Always

Steps to Reproduce:

1.
2.
3.

Actual results:

# oc --kubeconfig=/root/hv-vm/sno/manifests/sno00001/kubeconfig get cm -n openshift-monitoring cluster-monitoring-config  -o yaml
apiVersion: v1
data:
  config.yaml: |
    alertmanagerMain:
      nodeSelector: null
      resources: null
      tolerations: null
      volumeClaimTemplate: null
    enableUserWorkload: null
    grafana:
      nodeSelector: null
      tolerations: null
    http: null
    k8sPrometheusAdapter: null
    kubeStateMetrics: null
    openshiftStateMetrics: null
    prometheusK8s:
      additionalAlertManagerConfigs:
      - apiVersion: v2
        bearerToken:
          key: token
          name: observability-alertmanager-accessor
        pathPrefix: /
        scheme: https
        staticConfigs:
        - alertmanager-open-cluster-management-observability.apps.bm-stage.rdu2.scalelab.redhat.com
        tlsConfig:
          ServerName: ""
          ca:
            key: service-ca.crt
            name: hub-alertmanager-router-ca
          insecureSkipVerify: false
      externalLabels:
        cluster: ccf6f406-53d9-4232-894d-db02695cf189
      logLevel: ""
      nodeSelector: null
      remoteWrite: null
      resources: null
      retention: 24h
      tolerations: null
      volumeClaimTemplate: null
    prometheusOperator: null
    telemeterClient: null
    thanosQuerier: null
kind: ConfigMap
metadata:
  creationTimestamp: "2022-09-01T18:52:16Z"
  name: cluster-monitoring-config
  namespace: openshift-monitoring
  resourceVersion: "263306"
  uid: f80d71cd-b3ba-43c7-b8b4-078f4a87d3c4

Expected results:

apiVersion: v1
kind: ConfigMap
metadata:
  name: cluster-monitoring-config
  namespace: openshift-monitoring
  annotations:
    ran.openshift.io/ztp-deploy-wave: "1"
data:
  config.yaml: |
    grafana:
      enabled: false
    alertmanagerMain:
      enabled: false
    prometheusK8s:
       retention: 24h

Additional info:

duplicates

OCPBUGS-1025 [tracker]cluster-monitoring-config race condition between Observability and du profile

ON_QA

Assignee:: Simon Pasquier

Reporter:: Alex Krzos

Votes:: 0 Vote for this issue

Watchers:: 5 Start watching this issue

Created:: 2022/09/02 5:35 PM

Updated:: 2024/09/25 5:44 PM

Resolved:: 2022/09/14 2:23 PM

Details

Description

Attachments

Issue Links

Easy Agile Planning Poker

Activity

People

Dates