Uploaded image for project: 'Fast Datapath Product'
  1. Fast Datapath Product
  2. FDP-2851

Spike: ovsdb-server stopped serving connections without logging errors

    • Icon: Bug Bug
    • Resolution: Unresolved
    • Icon: Major Major
    • None
    • None
    • openvswitch3.3
    • None
    • False
    • Hide

      None

      Show
      None
    • False
    • Hide

      Please mark each item below with ( / ) if completed or ( x ) if incomplete:
      ( ) Unit test or Integration test case are written and pass successfully


      ( ) The upstream pull request is merged upstream and pass CI

      Show
      Please mark each item below with ( / ) if completed or ( x ) if incomplete: ( ) Unit test or Integration test case are written and pass successfully ( ) The upstream pull request is merged upstream and pass CI
    • rhel-9
    • None
    • rhel-net-ovs-dpdk

      This is tracking the upstream effort needed to deliver the solution to the bug described below.


       Problem Description:

      In RHOSP 17.1 ML2/OVN environment workloads lost network connectivity. Customer collected sosreport from problematic compute node and solved problem by restarting ovsdb-server.service.

      In collected sosreport:

      • there are no errors in /var/log/openvswitch/ovsdb-server.log while ps output tells that ovsdb-server process is running
      • there are recurring errors in /var/log/containers/openvswitch/ovn-controller.log telling that connection to unix:/var/run/openvswitch/br-int.mgmt failed because of protocol error
      • sosreport tool itself failed to collect various ovs outputs because of timeout (so it confirms that some problem is generally there)

      Example error from /var/log/containers/openvswitch/ovn-controller.log:

      2025-11-27T11:57:11.004Z|00354|rconn|WARN|unix:/var/run/openvswitch/br-int.mgmt: connection failed (Protocol error)

       

       Impact Assessment: problem is solved now, network connectivity was completely off when problem was there

       

       Software Versions: openvswitch3.3-3.3.6-141.el9fdp.x86_64

       

        Issue Type: bug

       

       Reproducibility: no reproducer

       

       Troubleshooting Actions: explained in description

       

       Logs: a single sosreport from compute node running affected ovsdb is attached to support case linked to this issue


              ovsdpdk-triage ovsdpdk triage
              nstbot NST Bot
              Votes:
              0 Vote for this issue
              Watchers:
              1 Start watching this issue

                Created:
                Updated: