Uploaded image for project: 'OpenShift Bugs'
  1. OpenShift Bugs
  2. OCPBUGS-59197

Image Mode. Pool degraded if the MOSC resource is deleted before the osImage is fully applied

XMLWordPrintable

    • Quality / Stability / Reliability
    • False
    • Hide

      None

      Show
      None
    • None
    • Moderate
    • None
    • None
    • None
    • None
    • None
    • None
    • None
    • None
    • None
    • None
    • None
    • None

      Description of problem:

      If we enable ImageMode in a MCP, and we remove the MOSC resource before the new osImage is fully applied, the MCP becomes degraded reporting this error:
      
        - lastTransitionTime: "2025-07-10T12:02:42Z"
          message: 'Node ip-10-0-11-240.us-east-2.compute.internal is reporting: "Node ip-10-0-11-240.us-east-2.compute.internal
            upgrade failure. failed to update OS to image-registry.openshift-image-registry.svc:5000/openshift-machine-config-operator/ocb-image@sha256:01971adebd6336f2da912840b92aac769a452a2830d34e420a58965cd77da5b8:
            error running rpm-ostree rebase --experimental ostree-unverified-registry:image-registry.openshift-image-registry.svc:5000/openshift-machine-config-operator/ocb-image@sha256:01971adebd6336f2da912840b92aac769a452a2830d34e420a58965cd77da5b8:
            error: Creating importer: failed to invoke method OpenImage: failed to invoke
            method OpenImage: reading manifest sha256:01971adebd6336f2da912840b92aac769a452a2830d34e420a58965cd77da5b8
            in image-registry.openshift-image-registry.svc:5000/openshift-machine-config-operator/ocb-image:
            manifest unknown\n: exit status 1", Node ip-10-0-11-240.us-east-2.compute.internal
            is reporting: "failed to update OS to image-registry.openshift-image-registry.svc:5000/openshift-machine-config-operator/ocb-image@sha256:01971adebd6336f2da912840b92aac769a452a2830d34e420a58965cd77da5b8:
            error running rpm-ostree rebase --experimental ostree-unverified-registry:image-registry.openshift-image-registry.svc:5000/openshift-machine-config-operator/ocb-image@sha256:01971adebd6336f2da912840b92aac769a452a2830d34e420a58965cd77da5b8:
            error: Creating importer: failed to invoke method OpenImage: failed to invoke
            method OpenImage: reading manifest sha256:01971adebd6336f2da912840b92aac769a452a2830d34e420a58965cd77da5b8
            in image-registry.openshift-image-registry.svc:5000/openshift-machine-config-operator/ocb-image:
            manifest unknown\n: exit status 1"'
          reason: 1 nodes are reporting degraded status on sync
          status: "True"
          type: NodeDegraded
      
      
      

      Version-Release number of selected component (if applicable):

      4.20
      

      How reproducible:

      Always
      

      Steps to Reproduce:

      
      1. Create a MOSC resource to enable ImageMode in the worker pool
      2. Wait until the MOSB finishes building the new osImage
      3. Wait until the first node  starts updating
      4. Immediately remove the MOSC resource
      
      

      Actual results:

      
      The MOSC resource will garbage collect the existing MOSBs and while doing that it will delete their osImages from the image registry.
      
      Since the osImage is gabage collected, the MCP which is updating the image in the nodes cannot properly apply it and it becomes degraded.
      
      

      Expected results:

      
      I'm not sure about what should be the expected behaviour when we remove the MOSC resource while the images are being applied. Nevertheless, we should avoid a degradation.
      
      

      Additional info:

      
      
      

              umohnani Urvashi Mohnani
              sregidor@redhat.com Sergio Regidor de la Rosa
              None
              None
              Sergio Regidor de la Rosa Sergio Regidor de la Rosa
              None
              Votes:
              0 Vote for this issue
              Watchers:
              4 Start watching this issue

                Created:
                Updated: