Uploaded image for project: 'OpenShift Bugs'
  1. OpenShift Bugs
  2. OCPBUGS-44780

[OCP 4.16] crio-wipe failed to clean storage directory /var/lib/containers/storage on boot

XMLWordPrintable

    • None
    • False
    • Hide

      None

      Show
      None

      Description of problem:

      After a non-clean reboot, during the boot process and subsequent trigger of crio-wipe, crio was unable to clean the storage:
      
      ~~~
      Nov 06 04:21:02 XXXXXXXXXXXXXXXXX crio[2987]: time="2024-11-06 04:21:02.567534481Z" level=info msg="Starting CRI-O, version: 1.29.9-2.rhaos4.16.git933bdd2.el9, git: unknown(clean)"
      Nov 06 04:21:02 XXXXXXXXXXXXXXXXX crio[2987]: time="2024-11-06 04:21:02.738954676Z" level=info msg="Checking whether cri-o should wipe containers: open /var/run/crio/version: no such file or directory"
      Nov 06 04:21:02 XXXXXXXXXXXXXXXXX crio[2987]: time="2024-11-06 04:21:02.739010978Z" level=info msg="File /var/lib/crio/clean.shutdown not found. Wiping storage directory /var/lib/containers/storage because of suspected dirty shutdown"
      ...
      Nov 06 04:21:45 XXXXXXXXXXXXXXXXX crio[2987]: time="2024-11-06 04:21:45.575787172Z" level=fatal msg="failed to remove storage directory: unlinkat /var/lib/containers/storage: directory not empty"
      Nov 06 04:21:45 XXXXXXXXXXXXXXXXX systemd[1]: crio-wipe.service: Main process exited, code=exited, status=1/FAILURE
      Nov 06 04:21:45 XXXXXXXXXXXXXXXXX systemd[1]: crio-wipe.service: Failed with result 'exit-code'.
      Nov 06 04:21:45 XXXXXXXXXXXXXXXXX systemd[1]: Failed to start CRI-O Auto Update Script.
      Nov 06 04:21:45 XXXXXXXXXXXXXXXXX systemd[1]: crio-wipe.service: Consumed 18.020s CPU time.
      ~~~

      Version-Release number of selected component (if applicable):

      Openshift 4.16.16
      cri-o 1.29.9-2.rhaos4.16.git933bdd2.el9

      How reproducible:

      N/A

      Steps to Reproduce:

          1.
          2.
          3.
          

      Actual results:

      crio-wipe failed to clear storage

      Expected results:

      crio-wipe clear the storage

      Additional info:

      This was related with a situation in which one pod has incorrect permissions on his storage volume and we want to clarify this issue as a possible root cause, also try to understand, if possible, why it happened.

       

              rh-ee-kwilczyn Krzysztof Wilczyński
              rhn-support-mabajodu Mario Abajo Duran
              Sunil Choudhary Sunil Choudhary
              Votes:
              0 Vote for this issue
              Watchers:
              4 Start watching this issue

                Created:
                Updated: