-
Bug
-
Resolution: Unresolved
-
Major
-
None
-
4.14
-
Important
-
No
-
SDN Sprint 253, SDN Sprint 254, SDN Sprint 255, SDN Sprint 256, SDN Sprint 257, SDN Sprint 258, SDN Sprint 259, SDN Sprint 260, SDN Sprint 261, SDN Sprint 262, SDN Sprint 263, SDN Sprint 264, SDN Sprint 265, SDN Sprint 266
-
14
-
False
-
-
-
-
-
Description of problem:
Under some circumstances (not clear exactly which ones), the OVN databases of 2 nodes ended up having 2 src-ip static routes in ovn_cluster_router instead of one: one of them points to the correct IP of the rtoj-GR_${NODE_NAME} LRP and one points to a wrong IP on the join subnet (that IP is not used in any other LRP or LSP).
Both static routes are taken into consideration while routing traffic out from the cluster, so packets that use the right route are able to egress while the packets that use the wrong route are dropped.
Version-Release number of selected component (if applicable):
Reproduced in 4.14.20
How reproducible:
At least once. Only 2 nodes of the cluster.
Steps to Reproduce:
(Not sure, it was just found after investigation of strange packet drop)
Actual results:
Wrong static route to some non-existent IP in the join subnet. Intermittent packet drop.
Expected results:
No wrong static routes. No packet drop.
Additional info:
This can be workarounded by wiping the OVN databases of the impacted node.
- is triggering
-
CORENET-968 Corrective Measure for OCPBUGS-32754: Stale routes to the join switch subnet cause intermittent drops during egress
-
- To Do
-
- links to
-
RHEA-2024:11038 OpenShift Container Platform 4.19.z bug fix update