Uploaded image for project: 'OpenShift Bugs'
  1. OpenShift Bugs
  2. OCPBUGS-35917

[release-4.14] LVMS: lvmd has no health checks

XMLWordPrintable

    • Important
    • No
    • 3
    • OCPEDGE Sprint 256
    • 1
    • False
    • Hide

      None

      Show
      None
    • the lvmd container in the topolvm-node daemonset was missing a health probe configuration
    • Bug Fix
    • In Progress

      This is an LVMS Bug Report:

      Please create & attach a must-gather as indicated by this Guide to collect LVMS relevant data from the Cluster (linked to the latest version, use older versions of the documentation for older OCP releases as applicable

      Please make sure that you describe your storage configuration in detail. List all devices that you plan to work with for LVMS as well as any relevant machine configuration data to make it easier for an engineer to help out.

      Description of problem:

      Since lvmd has no health checks, if it gets stuck for some reason like networking issues, there is no way that it will auto-recover.  

      Version-Release number of selected component (if applicable):

       4.14.z

      Steps to Reproduce:

      It is hard to reproduce, but we saw this happening in some clusters with network issues. 

      Actual results:

      lvmd gets stuck    

      Expected results:

      lvmd gets restarted and continues functioning.    

      Additional info:

      TopoLVM Node has a csi probe that we can also copy into lvmd. 

            rh-ee-jmoller Jakob Moeller
            sakbas@redhat.com Suleyman Akbas
            Minal Pradeep Makwana Minal Pradeep Makwana
            Votes:
            0 Vote for this issue
            Watchers:
            5 Start watching this issue

              Created:
              Updated:
              Resolved: