Uploaded image for project: 'AMQ Interconnect'
  1. AMQ Interconnect
  2. ENTMQIC-3394

Some Links Fail to Reestablish Between Hosted Routers in OpenShift and External Routers After a Multiple Connection Loss (rhel7 image build)

XMLWordPrintable

    • Icon: Bug Bug
    • Resolution: Done-Errata
    • Icon: Major Major
    • None
    • 1.10.4.GA
    • Qpid Dispatch Router
    • None
    • False
    • None
    • False
    • Critical
    • Customer Facing

      In a topology consisting of a router network hosted outside of OpenShift (two pairs of client-facing routers, linked to two pairs of outward-facing routers) and a router pair hosted in OpenShift, we had a recent incident where the connection was lost between the outward-facing on-prem routers and the hosted routers (Link to Neighbor Router Lost). Most of the links were immediately reestablished, but one link failed to reestablish between the on-prem and hosted routers. We can see that the link was reattached on the client-facing and outward-facing on-prem routers, but the link never got recreated on the hosted router until the application was restarted 3 hours later.

      From the analysis following the incident, this appears to have been cause by a relatively unique set of circumstances in which a primary connection failure that resulted in the initial link losses was followed by a secondary connection failure in the network, resulting in a race condition while reestablishing / attaching the links.

              gmurthy@redhat.com Ganesh Murthy
              rhn-support-dhawkins Duane Hawkins
              Votes:
              0 Vote for this issue
              Watchers:
              2 Start watching this issue

                Created:
                Updated:
                Resolved: