Uploaded image for project: 'OpenShift Bugs'
  1. OpenShift Bugs
  2. OCPBUGS-48316

Node reboot fails due to conflicting OSTree finalize staged transactions

XMLWordPrintable

    • Icon: Bug Bug
    • Resolution: Done
    • Icon: Undefined Undefined
    • None
    • 4.16.z, 4.18
    • RHCOS
    • Important
    • None
    • Proposed
    • x86_64
    • False
    • Hide

      None

      Show
      None

      Description of problem:

      While applying kernel argument changes using the Machine Config Daemon (MCD) on worker node worker0, the node reboot process failed due to conflicting systemd transactions related to ostree-finalize-staged. The failure occurred during the execution of the machine-config-daemon-reboot.service.

      Version-Release number of selected component (if applicable):

      NAME      VERSION   AVAILABLE   PROGRESSING   SINCE   STATUS
      version   4.16.27   True        False         26m     Error while reconciling 4.16.27: the cluster operator machine-config is degraded

      How reproducible:

          

      Steps to Reproduce:

          1. Execute a machine configuration update involving kernel arguments during the RHACM initial deployment.
          2. Observe the logs of the Machine Config Daemon and systemd during the reboot process. 
          3. Notice the conflicting transactions for ostree-finalize-staged.     

      Actual results:

         Logs indicate:bashCopy codeRequested transaction contradicts existing jobs: Transaction for ostree-finalize-staged.service/start is destructive (ostree-finalize-staged-hold.service has 'stop' job queued, but 'start' is included in transaction).
      The node fails to reboot properly.The ostree-finalize-staged.path job fails with result 'resources'.

      Expected results:

      Node reboots successfully into the new configuration without transactional conflicts.
      No errors in the systemd services related to ostree-finalize-staged.

      Additional info:

          Jan 08 00:28:44 worker0 systemd[1]: Requested transaction contradicts existing jobs: Transaction for ostree-finalize-staged.service/start is destructive (ostree-finalize-staged-hold.service has 'stop' job queued, but 'start' is included in transaction). Jan 08 00:28:44 worker0 systemd[1]: ostree-finalize-staged.path: Failed to queue unit startup job: Transaction for ostree-finalize-staged.service/start is destructive. Jan 08 00:28:44 worker0 systemd[1]: ostree-finalize-staged.path: Failed with result 'resources'. 

      More detials outlined in here: https://docs.google.com/document/d/1ywZ-Is2WgzdgtMGtDCn0XyHLJ9m4Pucr90KK70E0Oew/edit?tab=t.0 

              Unassigned Unassigned
              midu@redhat.com Mihai IDU
              Sergio Regidor de la Rosa Sergio Regidor de la Rosa
              Votes:
              0 Vote for this issue
              Watchers:
              15 Start watching this issue

                Created:
                Updated:
                Resolved: