Uploaded image for project: 'OpenShift Bugs'
  1. OpenShift Bugs
  2. OCPBUGS-49347

Degraded machine-config CO due to ManagedBootImages update when upgrading an OCL cluster

XMLWordPrintable

    • Moderate
    • No
    • 2
    • MCO Sprint 266, MCO Sprint 267
    • 2
    • False
    • Hide

      None

      Show
      None
    • Release Note Not Required
    • In Progress

      Description of problem:

      When we upgrade an OCL cluster from 4.18 -> 4.18, and we configure a machineset so that its base cloud Image is automatically updated in the upgrade, the machine-config CO  may become degraded with this message:
      
        - lastTransitionTime: "2025-01-24T19:58:23Z"
          message: 'Unable to apply 4.18.0-0.nightly-2025-01-24-014549: bootimage update
            failed: 1 Degraded MAPI MachineSets | 0 Degraded CAPI MachineSets | 0 CAPI MachineDeployments
            | Error(s): error syncing MAPI MachineSet cloned-tc-70813-label: timed out waiting
            for coreos-bootimages config map: mismatch between MCO hash version stored in
            configmap and current MCO version; sync will exit to wait for the MCO upgrade
            to complete'
          reason: MachineConfigurationFailed
          status: "True"
          type: Degraded
      
      Link to the prow execution: https://qe-private-deck-ci.apps.ci.l2s4.p1.openshiftapps.com/view/gs/qe-private-deck/logs/periodic-ci-openshift-openshift-tests-private-release-4.18-amd64-nightly-4.18-upgrade-from-stable-4.18-aws-ipi-ocl-fips-f60/1882788481910968320
      
      
          

      Version-Release number of selected component (if applicable):

      Upgrading from 4.18.0-rc.6 to 4.18.0-0.nightly-2025-01-24-014549
          

      How reproducible:

      Rarely
          

      Steps to Reproduce:

          1. install 4.18.0-rc.6 
          2. Clone an existing machineset 
          3. Configure the new machineset so that its base cloud image is updated automatically in the upgraded. Use a label configuration, so that only this machineset is updated.
          4. Upgrade to 4.18.0-0.nightly-2025-01-24-014549
          

      Actual results:

      
      The machine-config CO is degraded with this message:
      
        - lastTransitionTime: "2025-01-24T19:58:23Z"
          message: 'Unable to apply 4.18.0-0.nightly-2025-01-24-014549: bootimage update
            failed: 1 Degraded MAPI MachineSets | 0 Degraded CAPI MachineSets | 0 CAPI MachineDeployments
            | Error(s): error syncing MAPI MachineSet cloned-tc-70813-label: timed out waiting
            for coreos-bootimages config map: mismatch between MCO hash version stored in
            configmap and current MCO version; sync will exit to wait for the MCO upgrade
            to complete'
          reason: MachineConfigurationFailed
          status: "True"
          type: Degraded
          

      Expected results:

      
          

      Additional info:

      It looks like the coreos-bootimages configma was never updated with the new MCOVersionHash
      
      It may not be related to OCL at all.
      
          

              djoshy David Joshy
              sregidor@redhat.com Sergio Regidor de la Rosa
              Sergio Regidor de la Rosa Sergio Regidor de la Rosa
              Votes:
              0 Vote for this issue
              Watchers:
              3 Start watching this issue

                Created:
                Updated: