Uploaded image for project: 'OpenShift Bugs'
  1. OpenShift Bugs
  2. OCPBUGS-22373

VRF OCP Primary network overlap traffic failure on Intel X710 SRIOV interface

    • Critical
    • No
    • CNF Network Sprint 245, CNF Network Sprint 246
    • 2
    • False
    • Hide

      None

      Show
      None
    • 10/17 - A mitigation from the openshift side has been implemented u/s and on 4.15. RCA still in progress on kernel side

      This is a clone of issue OCPBUGS-21831. The following is the description of the original issue:

      This is a clone of issue OCPBUGS-19536. The following is the description of the original issue:

      Description of problem:

      When running the cnf-feature-deploy VRF test case withover lapping IPs on the OCP primary interface traffic does not flow over any of the secondary interfaces VRF and non-VRF. This test case fails on the Intel X810 card but passes on the Mellanox Connect-X5 card. 

      Version-Release number of selected component (if applicable):

       

      How reproducible:

      Easily

      Steps to Reproduce:

      1. Deploy OCP 4.14-rc1 on BM cluster with SRIOV Intel X710 card
      2. Run VRF Overlapping CNF-test
      

      Actual results:

        [FAILED] Unexpected error:      <*fmt.wrapError | 0xc0005d24e0>:       remote command [ping -I red -c5 10.128.2.194] error [command terminated with exit code 1]. output [ping: Warning: source address might be selected on device other than red.      PING 10.128.2.194 (10.128.2.194) from 10.128.2.193 red: 56(84) bytes of data.      From 10.128.2.193 icmp_seq=1 Destination Host Unreachable      From 10.128.2.193 icmp_seq=2 Destination Host Unreachable      From 10.128.2.193 icmp_seq=3 Destination Host Unreachable      From 10.128.2.193 icmp_seq=4 Destination Host Unreachable      From 10.128.2.193 icmp_seq=5 Destination Host Unreachable            --- 10.128.2.194 ping statistics ---      5 packets transmitted, 0 received, +5 errors, 100% packet loss, time 4107ms      pipe 3      ]      {

      Expected results:

      ping -I red -c5 10.128.2.194 should be successful on both Intel X710 and MLX Connect-X5 card

      Additional info:

      The test also fails with 4.13.11

            [OCPBUGS-22373] VRF OCP Primary network overlap traffic failure on Intel X710 SRIOV interface

            Since the problem described in this issue should be resolved in a recent advisory, it has been closed.

            For information on the advisory (OpenShift Container Platform 4.13.27 security and extras update), and where to find the updated files, follow the link below.

            If the solution does not work for you, open a new bug report.
            https://access.redhat.com/errata/RHBA-2023:7826

            Errata Tool added a comment - Since the problem described in this issue should be resolved in a recent advisory, it has been closed. For information on the advisory (OpenShift Container Platform 4.13.27 security and extras update), and where to find the updated files, follow the link below. If the solution does not work for you, open a new bug report. https://access.redhat.com/errata/RHBA-2023:7826

            Looks like this bug is far enough along in the workflow that a code fix is ready. Customers and support need to know the backport plan. Please complete the "Target Backport Versions" field to indicate which version(s) will receive the fix.

            OpenShift Jira Bot added a comment - Looks like this bug is far enough along in the workflow that a code fix is ready. Customers and support need to know the backport plan. Please complete the " Target Backport Versions " field to indicate which version(s) will receive the fix.

            QE Validation

            OCP 4.13.27 

            Ran VRF regression test cases. All test cases passed

            Bux fix validated

            Gregory Kopels added a comment - QE Validation OCP 4.13.27  Ran VRF regression test cases. All test cases passed Bux fix validated

            The VRF test cases failed on our last zsstream test run. OCP 4.13.23 because a related bug effecting Intel SRIOV interfaces 710 and 810 Intel bug https://issues.redhat.com/browse/RHEL-7168

            There is a bug fix that was verified on 4.14 and has just now been backported to 4.13. At the next 4.13 zstream test run we can verify this bug. 
            PR to backport the fix to 4.13 https://github.com/openshift/sriov-cni/pull/93

            Gregory Kopels added a comment - The VRF test cases failed on our last zsstream test run. OCP 4.13.23 because a related bug effecting Intel SRIOV interfaces 710 and 810 Intel bug https://issues.redhat.com/browse/RHEL-7168 There is a bug fix that was verified on 4.14 and has just now been backported to 4.13. At the next 4.13 zstream test run we can verify this bug.  PR to backport the fix to 4.13 https://github.com/openshift/sriov-cni/pull/93

            Hi apanatto@redhat.com,

            Bugs should not be moved to Verified without first providing a Release Note Type("Bug Fix" or "No Doc Update") and for type "Bug Fix" the Release Note Text must also be provided. Please populate the necessary fields before moving the Bug to Verified.

            OpenShift Jira Bot added a comment - Hi apanatto@redhat.com , Bugs should not be moved to Verified without first providing a Release Note Type("Bug Fix" or "No Doc Update") and for type "Bug Fix" the Release Note Text must also be provided. Please populate the necessary fields before moving the Bug to Verified.

              apanatto@redhat.com Andrea Panattoni
              openshift-crt-jira-prow OpenShift Prow Bot
              Gregory Kopels Gregory Kopels
              Votes:
              0 Vote for this issue
              Watchers:
              6 Start watching this issue

                Created:
                Updated:
                Resolved: