Uploaded image for project: 'OpenShift Bugs'
  1. OpenShift Bugs
  2. OCPBUGS-14605

CNV regression with recent Kubernetes rebase - device plugin

XMLWordPrintable

    • Critical
    • No
    • CNF Compute Sprint 239, CNF Compute Sprint 240, CNF Compute Sprint 241, CNF Compute Sprint 242, CNF Compute Sprint 243, CNF Compute Sprint 244, CNF Compute Sprint 245
    • 7
    • Rejected
    • False
    • Hide

      None

      Show
      None
    • Hide
      20231030: GREEN but trending YELLOW: QA validation ongoing. Trending yellow because of challenges to set up SRIOV on 4.15
      20231003: GREEN but trending YELLOW: QA validation ongoing. Trending yellow because of challenges to set up SRIOV.
      20230919: GREEN OCP team likely to pull patch version from septembers (including the fixes) but final decision still pending (capacity, other prios)

      Show
      20231030: GREEN but trending YELLOW: QA validation ongoing. Trending yellow because of challenges to set up SRIOV on 4.15 20231003: GREEN but trending YELLOW: QA validation ongoing. Trending yellow because of challenges to set up SRIOV. 20230919: GREEN OCP team likely to pull patch version from septembers (including the fixes) but final decision still pending (capacity, other prios)

      Description of problem:

      Pods are being terminated on Kubelet restart if they consume any device.
      
      In case of CNV this Pods are carrying VMs and the assuption is that Kubelet will not terminate the Pod in this case.

      Version-Release number of selected component (if applicable):

      4.14 / 4.13.z / 4.12.z

      How reproducible:

      This should be reproducable with any device plugin as far as goes my understanding

      Steps to Reproduce:

      1. Create Pod requesting device plugin
      2. Restart Kubelet
      3.
      

      Actual results:

      Admission error -> Pod terminates

      Expected results:

      No error -> Existing & Running Pods will continue running after Kubelet restart

      Additional info:

      The culprit seems to be https://github.com/kubernetes/kubernetes/pull/116376

            fromani@redhat.com Francesco Romani
            lpivarc Luboslav Pivarc
            Sunil Choudhary Sunil Choudhary
            Votes:
            1 Vote for this issue
            Watchers:
            30 Start watching this issue

              Created:
              Updated:
              Resolved: