Uploaded image for project: 'OpenShift Bugs'
  1. OpenShift Bugs
  2. OCPBUGS-18822

pod with sr-iov VF stuck in init state and can not start due to driver error. Sporadic

XMLWordPrintable

    • Moderate
    • No
    • False
    • Hide

      None

      Show
      None

      Description of problem:

      We have sporadic failures in our SR-IOV suites. Few test cases failed due to
       
      ERRORED: error configuring pod [sriov-operator-tests/testpod-prlx7] networking: [sriov-operator-tests/testpod-prlx7/f1643ebb-6ae8-40fd-b695-22f4e83e32a4:test-sriov-static-jumbo]: error adding container to network "test-sriov-static-jumbo": SRIOV-CNI failed to load netconf: LoadConf(): the VF 0000:86:01.5 does not have a interface name or a dpdk driver
      
      
      
      oc describe pod shows>
      
      
      Warning  FailedCreatePodSandBox  10s kubelet  Failed to create pod sandbox: rpc error: code = Unknown desc = failed to create pod network sandbox k8s_testpod-prlx7_sriov-operator-tests_f1643ebb-6ae8-40fd-b695-22f4e83e32a4_0(fb1a23028fb7b1d6f9ee923605f1dc3be8171bf31ec6e526dfb9cc48d1c10f2a): error adding pod sriov-operator-tests_testpod-prlx7 to CNI network "multus-cni-network": plugin type="multus-shim" name="multus-cni-network" failed (add): CmdAdd (shim): CNI request failed with status 400: '&{ContainerID:fb1a23028fb7b1d6f9ee923605f1dc3be8171bf31ec6e526dfb9cc48d1c10f2a Netns:/var/run/netns/b9d3f18a-5ef0-4aae-9d02-1e9654693112 IfName:eth0 Args:IgnoreUnknown=1;K8S_POD_NAMESPACE=sriov-operator-tests;K8S_POD_NAME=testpod-prlx7;K8S_POD_INFRA_CONTAINER_ID=fb1a23028fb7b1d6f9ee923605f1dc3be8171bf31ec6e526dfb9cc48d1c10f2a;K8S_POD_UID=f1643ebb-6ae8-40fd-b695-22f4e83e32a4 Path: StdinData:[123 34 98 105 110 68 105 114 34 58 34 47 118 97 114 47 108 105 98 47 99 110 105 47 98 105 110 34 44 34 99 108 117 115 116 101 114 78 101 116 119 111 114 107 34 58 34 47 104 111 115 116 47 114 117 110 47 109 117 108 116 117 115 47 99 110 105 47 110 101 116 46 100 47 49 48 45 111 118 110 45 107 117 98 101 114 110 101 116 101 115 46 99 111 110 102 34 44 34 99 110 105 86 101 114 115 105 111 110 34 58 34 48 46 51 46 49 34 44 34 100 97 101 109 111 110 83 111 99 107 101 116 68 105 114 34 58 34 47 114 117 110 47 109 117 108 116 117 115 47 115 111 99 107 101 116 34 44 34 103 108 111 98 97 108 78 97 109 101 115 112 97 99 101 115 34 58 34 100 101 102 97 117 108 116 44 111 112 101 110 115 104 105 102 116 45 109 117 108 116 117 115 44 111 112 101 110 115 104 105 102 116 45 115 114 105 111 118 45 110 101 116 119 111 114 107 45 111 112 101 114 97 116 111 114 34 44 34 108 111 103 76 101 118 101 108 34 58 34 118 101 114 98 111 115 101 34 44 34 108 111 103 84 111 83 116 100 101 114 114 34 58 116 114 117 101 44 34 110 97 109 101 34 58 34 109 117 108 116 117 115 45 99 110 105 45 110 101 116 119 111 114 107 34 44 34 110 97 109 101 115 112 97 99 101 73 115 111 108 97 116 105 111 110 34 58 116 114 117 101 44 34 114 101 97 100 105 110 101 115 115 105 110 100 105 99 97 116 111 114 102 105 108 101 34 58 34 47 104 111 115 116 47 114 117 110 47 109 117 108 116 117 115 47 99 110 105 47 110 101 116 46 100 47 49 48 45 111 118 110 45 107 117 98 101 114 110 101 116 101 115 46 99 111 110 102 34 44 34 116 121 112 101 34 58 34 109 117 108 116 117 115 45 115 104 105 109 34 44 10 32 32 32 32 34 99 110 105 86 101 114 115 105 111 110 34 58 32 34 48 46 51 46 49 34 44 10 32 32 32 32 34 99 104 114 111 111 116 68 105 114 34 58 32 34 47 104 111 115 116 114 111 111 116 34 44 10 32 32 32 32 34 108 111 103 84 111 83 116 100 101 114 114 34 58 32 116 114 117 101 44 10 32 32 32 32 34 108 111 103 76 101 118 101 108 34 58 32 34 118 101 114 98 111 115 101 34 44 10 32 32 32 32 34 98 105 110 68 105 114 34 58 32 34 47 118 97 114 47 108 105 98 47 99 110 105 47 98 105 110 34 44 10 32 32 32 32 34 99 110 105 67 111 110 102 105 103 68 105 114 34 58 32 34 47 104 111 115 116 47 101 116 99 47 99 110 105 47 110 101 116 46 100 34 44 10 32 32 32 32 34 109 117 108 116 117 115 67 111 110 102 105 103 70 105 108 101 34 58 32 34 97 117 116 111 34 44 10 32 32 32 32 34 109 117 108 116 117 115 65 117 116 111 99 111 110 102 105 103 68 105 114 34 58 32 34 47 104 111 115 116 47 114 117 110 47 109 117 108 116 117 115 47 99 110 105 47 110 101 116 46 100 34 44 10 32 32 32 32 34 110 97 109 101 115 112 97 99 101 73 115 111 108 97 116 105 111 110 34 58 32 116 114 117 101 44 10 32 32 32 32 34 103 108 111 98 97 108 78 97 109 101 115 112 97 99 101 115 34 58 32 34 100 101 102 97 117 108 116 44 111 112 101 110 115 104 105 102 116 45 109 117 108 116 117 115 44 111 112 101 110 115 104 105 102 116 45 115 114 105 111 118 45 110 101 116 119 111 114 107 45 111 112 101 114 97 116 111 114 34 44 10 32 32 32 32 34 114 101 97 100 105 110 101 115 115 105 110 100 105 99 97 116 111 114 102 105 108 101 34 58 32 34 47 104 111 115 116 47 114 117 110 47 109 117 108 116 117 115 47 99 110 105 47 110 101 116 46 100 47 49 48 45 111 118 110 45 107 117 98 101 114 110 101 116 101 115 46 99 111 110 102 34 44 10 32 32 32 32 34 100 97 101 109 111 110 83 111 99 107 101 116 68 105 114 34 58 32 34 47 114 117 110 47 109 117 108 116 117 115 47 115 111 99 107 101 116 34 44 10 32 32 32 32 34 115 111 99 107 101 116 68 105 114 34 58 32 34 47 104 111 115 116 47 114 117 110 47 109 117 108 116 117 115 47 115 111 99 107 101 116 34 10 125 10]} ContainerID:"fb1a23028fb7b1d6f9ee923605f1dc3be8171bf31ec6e526dfb9cc48d1c10f2a" Netns:"/var/run/netns/b9d3f18a-5ef0-4aae-9d02-1e9654693112" IfName:"eth0" Args:"IgnoreUnknown=1;K8S_POD_NAMESPACE=sriov-operator-tests;K8S_POD_NAME=testpod-prlx7;K8S_POD_INFRA_CONTAINER_ID=fb1a23028fb7b1d6f9ee923605f1dc3be8171bf31ec6e526dfb9cc48d1c10f2a;K8S_POD_UID=f1643ebb-6ae8-40fd-b695-22f4e83e32a4" Path:"" ERRORED: error configuring pod [sriov-operator-tests/testpod-prlx7] networking: [sriov-operator-tests/testpod-prlx7/f1643ebb-6ae8-40fd-b695-22f4e83e32a4:test-sriov-static-jumbo]: error adding container to network "test-sriov-static-jumbo": SRIOV-CNI failed to load netconf: LoadConf(): the VF 0000:86:01.5 does not have a interface name or a dpdk driver
      
      

      Version-Release number of selected component (if applicable):

      sriov-network-operator.v4.14.0-202308242104   SR-IOV Network Operator   4.14.0-202308242104              Succeeded
      
      Kustomize Version: v4.5.4
      Server Version: 4.14.0-rc.0
      Kubernetes Version: v1.27.4+2c83a9f
      

      How reproducible:

      The issue is sporadic and occurs rarely 1-2 times out of 100 test cases.

      Steps to Reproduce:

      1. Run full CNF sr-iov test suite (about 500 test cases)
      2. 5-7 will fail due to this issue
      

      Actual results:

      Few test failed

      Expected results:

      All test passed

      Additional info:

      QE is going to collect additional logs when the issue occur.

              bnemeth@redhat.com Balazs Nemeth
              nkononov@redhat.com Nikita Kononov
              Zhanqi Zhao Zhanqi Zhao
              Votes:
              0 Vote for this issue
              Watchers:
              6 Start watching this issue

                Created:
                Updated:
                Resolved: