Uploaded image for project: 'OpenShift Bugs'
  1. OpenShift Bugs
  2. OCPBUGS-76859

UPI PXE installs fail during node-image-pull on RHCOS 4.20.11+ due to tmpfs exhaustion at /var/ostree-container

    • Icon: Bug Bug
    • Resolution: Unresolved
    • Icon: Normal Normal
    • None
    • 4.20.z
    • None
    • None
    • False
    • Hide

      None

      Show
      None
    • None
    • Important
    • None
    • None
    • None
    • None
    • None
    • None
    • None
    • None
    • None
    • None
    • None
    • None

      Description of problem:

      During UPI network install on UPI cluster, the installation fails consistently on RHCOS 4.20.11 and 4.20.13 when node-image-pull.service attempts to import the release image via ostree.
      
      Journal shows:
      Importing regfile: min-free-space-percent '3%' would be exceeded Filesystem:
      tmpfs /var/ostree-container ~4GB, ~96% used Increasing tmpfs size allows the install to complete successfully.

      How reproducible:

      100% on:4.20.11 4.20.13
      Not reproducible on:4.20.0
      
      

      Steps to Reproduce:

      Behavior does not reproduce on RHCOS 4.20.0.

      This suggests a regression in: RHCOS live environment tmpfs sizing, payload size assumptions or ostree import behavior.

      Actual results:

      New clusters cannot be installed using latest 4.20.z images unless:
      
      tmpfs manually resized, OR
      images pinned to 4.20.0

      Expected results:

      It should include without manual intervention    

      Additional info:

          Case: https://access.redhat.com/support/cases/#/case/04376952 

              Unassigned Unassigned
              rhn-support-ksuthar Komal Suthar
              Jad Haj Yahya Jad Haj Yahya
              None
              Votes:
              0 Vote for this issue
              Watchers:
              2 Start watching this issue

                Created:
                Updated: