Uploaded image for project: 'OpenShift Bugs'
  1. OpenShift Bugs
  2. OCPBUGS-57504

CBO may show as Degraded briefly during Metal3 initialization

XMLWordPrintable

    • Quality / Stability / Reliability
    • False
    • Hide

      None

      Show
      None
    • None
    • Important
    • None
    • None
    • Proposed
    • Metal Platform 272, Metal Platform 273
    • 2
    • Done
    • Release Note Not Required
    • N/A
    • None
    • None
    • None
    • None

      This is a clone of issue OCPBUGS-56723. The following is the description of the original issue:

      There is an issue in the status logic https://github.com/openshift/cluster-baremetal-operator/blob/2a6cf5336fd7aafa8df59331cfb12c968d104c64/controllers/clusteroperator.go#L242-L260: status ReasonResourceNotFound is not a fatal error, it's a temporary condition while the Metal3 pod does not exist yet. Setting the operator's status as Degraded is wrong since nothing bad is happening. We should set Degraded=False, Progressing=True, Available=False instead.

      Spotted in https://prow.ci.openshift.org/view/gs/test-platform-results/pr-logs/pull/openshift_baremetal-operator/418/pull-ci-openshift-baremetal-operator-main-e2e-metal-ipi-serial-ipv4/1927125756081606656, it may be tricky to reproduce manually. Maybe by watching the cluster operators' status and associated error while the installer is still running?

              rhn-engineering-hpokorny Honza Pokorny
              openshift-crt-jira-prow OpenShift Prow Bot
              None
              None
              Steeve Goveas Steeve Goveas
              None
              Votes:
              0 Vote for this issue
              Watchers:
              6 Start watching this issue

                Created:
                Updated:
                Resolved: