Uploaded image for project: 'OpenShift Bugs'
  1. OpenShift Bugs
  2. OCPBUGS-49398

the actual issue was identified as a pending state for the volume mount for "csivol-fa6fcead42" in all the affected pods. Nine pods were reported to have experienced FailedMount errors, but only three of them existed currently.

XMLWordPrintable

    • Icon: Bug Bug
    • Resolution: Done-Errata
    • Icon: Critical Critical
    • 4.13.z
    • 4.12.z
    • RHCOS
    • Critical
    • None
    • True
    • Hide

      Due to this bug, nfs mount is getting stuck over the worker nodes and CU needs to reboot it every-time.

      After reboot mount works fine for some time but then when we have workload moved on the node, the bug hits again. 

      Because of this CU needs to reboot 4-5 nodes on the daily basis in the middle of their migration.

      Show
      Due to this bug, nfs mount is getting stuck over the worker nodes and CU needs to reboot it every-time. After reboot mount works fine for some time but then when we have workload moved on the node, the bug hits again.  Because of this CU needs to reboot 4-5 nodes on the daily basis in the middle of their migration.
    • Hide
      * Previously, pods could get stuck when generating `FailedMount` errors. These errors caused nodes to need additional reboots and for the Network File System (NFS) volume mount to stay in a pending state. With this release, a kernel update fixes the issue so that pods no longer get stuck because nodes no longer need to be rebooted to clear the `FailedMount` errors. (link:https://issues.redhat.com/browse/OCPBUGS-49398[*OCPBUGS-49398*])
      Show
      * Previously, pods could get stuck when generating `FailedMount` errors. These errors caused nodes to need additional reboots and for the Network File System (NFS) volume mount to stay in a pending state. With this release, a kernel update fixes the issue so that pods no longer get stuck because nodes no longer need to be rebooted to clear the `FailedMount` errors. (link: https://issues.redhat.com/browse/OCPBUGS-49398 [* OCPBUGS-49398 *])
    • Bug Fix
    • Done
    • Hide
      Customer is in the middle of the migration and got blocked due to this bug.
      It is impacting their production environment and this cluster is hosting charging application which is their main revenue generating application.
      Show
      Customer is in the middle of the migration and got blocked due to this bug. It is impacting their production environment and this cluster is hosting charging application which is their main revenue generating application.

      Description of problem:

          the actual issue was identified as a pending state for the volume mount for "csivol-fa6fcead42" in all the affected pods. Nine pods were reported to have experienced FailedMount errors, but only three of them existed currently.
      
      ## This issue is blocked due to the following bug
      https://bugzilla.redhat.com/show_bug.cgi?id=2061259

      Version-Release number of selected component (if applicable):

          

      How reproducible:

          This is a bug. Fixed is already identified in 8.8

      Steps to Reproduce:

          1.
          2.
          3.
          

      Actual results:

          

      Expected results:

          

      Additional info:

          

              Unassigned Unassigned
              rh-ee-bnazkani Bhuwan Nazkani
              Michael Nguyen Michael Nguyen
              rhel-sst-kernel-livepatching
              Votes:
              0 Vote for this issue
              Watchers:
              15 Start watching this issue

                Created:
                Updated:
                Resolved: