-
Bug
-
Resolution: Done-Errata
-
Normal
-
4.18
-
Quality / Stability / Reliability
-
False
-
-
2
-
Moderate
-
No
-
None
-
None
-
MCO Sprint 266, MCO Sprint 267
-
2
-
In Progress
-
Release Note Not Required
-
None
-
None
-
None
-
None
-
None
Description of problem:
When we upgrade an OCL cluster from 4.18 -> 4.18, and we configure a machineset so that its base cloud Image is automatically updated in the upgrade, the machine-config CO may become degraded with this message:
- lastTransitionTime: "2025-01-24T19:58:23Z"
message: 'Unable to apply 4.18.0-0.nightly-2025-01-24-014549: bootimage update
failed: 1 Degraded MAPI MachineSets | 0 Degraded CAPI MachineSets | 0 CAPI MachineDeployments
| Error(s): error syncing MAPI MachineSet cloned-tc-70813-label: timed out waiting
for coreos-bootimages config map: mismatch between MCO hash version stored in
configmap and current MCO version; sync will exit to wait for the MCO upgrade
to complete'
reason: MachineConfigurationFailed
status: "True"
type: Degraded
Link to the prow execution: https://qe-private-deck-ci.apps.ci.l2s4.p1.openshiftapps.com/view/gs/qe-private-deck/logs/periodic-ci-openshift-openshift-tests-private-release-4.18-amd64-nightly-4.18-upgrade-from-stable-4.18-aws-ipi-ocl-fips-f60/1882788481910968320
Version-Release number of selected component (if applicable):
Upgrading from 4.18.0-rc.6 to 4.18.0-0.nightly-2025-01-24-014549
How reproducible:
Rarely
Steps to Reproduce:
1. install 4.18.0-rc.6
2. Clone an existing machineset
3. Configure the new machineset so that its base cloud image is updated automatically in the upgraded. Use a label configuration, so that only this machineset is updated.
4. Upgrade to 4.18.0-0.nightly-2025-01-24-014549
Actual results:
The machine-config CO is degraded with this message:
- lastTransitionTime: "2025-01-24T19:58:23Z"
message: 'Unable to apply 4.18.0-0.nightly-2025-01-24-014549: bootimage update
failed: 1 Degraded MAPI MachineSets | 0 Degraded CAPI MachineSets | 0 CAPI MachineDeployments
| Error(s): error syncing MAPI MachineSet cloned-tc-70813-label: timed out waiting
for coreos-bootimages config map: mismatch between MCO hash version stored in
configmap and current MCO version; sync will exit to wait for the MCO upgrade
to complete'
reason: MachineConfigurationFailed
status: "True"
type: Degraded
Expected results:
Additional info:
It looks like the coreos-bootimages configma was never updated with the new MCOVersionHash
It may not be related to OCL at all.
- links to
-
RHEA-2024:11038
OpenShift Container Platform 4.19.z bug fix update