Uploaded image for project: 'Fast Datapath Product'
  1. Fast Datapath Product
  2. FDP-1567

Many Mac_Binding related transaction errors when using IPv6

    • Icon: Bug Bug
    • Resolution: Unresolved
    • Icon: Major Major
    • None
    • None
    • ovn24.03
    • None
    • 8
    • False
    • Hide

      None

      Show
      None
    • False
    • Hide

      Given an OVN deployment with an HA DGP and multiple chassis and IPv6 ND_NA packets are received on a logical router port,

      When the MAC_Binding table is updated in response to ND_NA,

      Then only the active HA chassis must attempt the update and all standby chassis must skip the transaction.

      Show
      Given an OVN deployment with an HA DGP and multiple chassis and IPv6 ND_NA packets are received on a logical router port, When the MAC_Binding table is updated in response to ND_NA, Then only the active HA chassis must attempt the update and all standby chassis must skip the transaction.
    • ovn24.03-24.03.6-48.el9fdp
    • rhel-9
    • None
    • rhel-net-ovn
    • ssg_networking
    • OVN FDP Sprint 8, OVN FDP Sprint 9
    • 2

       Problem Description: Clearly explain the issue.

      When using Distributed Gateway Port and HA, IPv6 ND_NA packet
      received on the router port causes all HA chassis to try updating the
      Mac_Binding table, often causing transaction errors.

      Reception of ARP packets do not cause the same issue.

       Impact Assessment: Describe the severity and impact (e.g., network down,availability of a workaround, etc.).

      Transaction errors causes full recomputes, hence high CPU utilization.

       Software Versions: Specify the exact versions in use (e.g.,openvswitch3.1-3.1.0-147.el8fdp).

      Reproduced on main.

        Issue Type: Indicate whether this is a new issue or a regression (if a regression, state the last known working version).

      Not a regression.

       Reproducibility: Confirm if the issue can be reproduced consistently. If not, describe how often it occurs.

      Not all ND_NA received on the router ports cause a transaction error (thanks to MAX_MAC_BINDING_DELAY_MSEC). 

      However, sending 50 ND_NA packets to 3 HA chassis I see the problems most of the times.

       Reproduction Steps: Provide detailed steps or scripts to replicate the issue.

      Send many ND_NA from outside to the router.

       Expected Behavior: Describe what should happen under normal circumstances.

      Only the active HA chassis should update sb, hence no transaction errors.

       Observed Behavior: Explain what actually happens.

      Reception of ND_NA packets causes all HA chassis to try to update the same mac_binding in sb, hence increasing the risk of transaction failures.

       Troubleshooting Actions: Outline the steps taken to diagnose or resolve the issue so far.

       Logs: If you collected logs please provide them (e.g. sos report, /var/log/openvswitch/* , testpmd console)

              xsimonar@redhat.com Xavier Simonart
              xsimonar@redhat.com Xavier Simonart
              Jianlin Shi Jianlin Shi
              Votes:
              0 Vote for this issue
              Watchers:
              3 Start watching this issue

                Created:
                Updated: