Uploaded image for project: 'OpenShift Bugs'
  1. OpenShift Bugs
  2. OCPBUGS-20209

[Multi-NIC]EgressIP was not moved to second egress node after first egress node unavailable

XMLWordPrintable

    • Moderate
    • No
    • SDN Sprint 243, SDN Sprint 244, SDN Sprint 245, SDN Sprint 246, SDN Sprint 247, SDN Sprint 248, SDN Sprint 249
    • 7
    • Rejected
    • False
    • Hide

      None

      Show
      None
    • Hide
      * Previously, EgressIP pods hosted by a secondary interface would not failover because of a race condition. Users would receive an error message indicating that the EgressIP pod could not be assigned because it conflicted with an existing IP address. With this update, the EgressIP pod moves to an egress node. (https://issues.redhat.com/browse/OCPBUGS-20209[(*OCPBUGS-20209*])
      Show
      * Previously, EgressIP pods hosted by a secondary interface would not failover because of a race condition. Users would receive an error message indicating that the EgressIP pod could not be assigned because it conflicted with an existing IP address. With this update, the EgressIP pod moves to an egress node. ( https://issues.redhat.com/browse/OCPBUGS-20209 [(* OCPBUGS-20209 *])
    • Bug Fix
    • Done

      Description of problem:

      Not able to reproduce it manually, but frequently happens when run auto scripts.
      
      

      Version-Release number of selected component (if applicable):

      4.14.0-0.nightly-2023-10-05-195247
      
      

      How reproducible:

      
      

      Steps to Reproduce:

      1. Label worker-0 node as egress node, created egressIP object,the egressIP was assigned to worker-0 node successfully on secondary NIC
      
      2. Block 9107 port on  worker-0 node and label worker-1 as egress node
      
      3.
      

      Actual results:

      EgressIP was not moved to second node
       % oc get egressip
      NAME             EGRESSIPS      ASSIGNED NODE   ASSIGNED EGRESSIPS
      egressip-66330   172.22.0.196
      
      
      40m         Warning   EgressIPConflict          egressip/egressip-66330       Egress IP egressip-66330 with IP 172.22.0.196 is conflicting with a host (worker-0) IP address and will not be assigned
      sh-4.4# ip a show enp1s0
      2: enp1s0: <BROADCAST,MULTICAST,UP,LOWER_UP> mtu 1500 qdisc fq_codel state UP group default qlen 1000
          link/ether 00:1c:cf:40:5d:25 brd ff:ff:ff:ff:ff:ff
          inet 172.22.0.109/24 brd 172.22.0.255 scope global dynamic noprefixroute enp1s0
             valid_lft 76sec preferred_lft 76sec
          inet6 fe80::21c:cfff:fe40:5d25/64 scope link noprefixroute 
             valid_lft forever preferred_lft forever
      
      

      Expected results:

      EgressIP should move to second egress node
      
      

      Additional info:

      Workaround: deleted it and recreated it works
      % oc get egressip
      NAME             EGRESSIPS      ASSIGNED NODE   ASSIGNED EGRESSIPS
      egressip-66330   172.22.0.196                   
      % oc delete egressip --all
      egressip.k8s.ovn.org "egressip-66330" deleted
       % oc create -f ../data/egressip/config1.yaml 
      egressip.k8s.ovn.org/egressip-3 created
      % oc get egressip
      NAME         EGRESSIPS      ASSIGNED NODE   ASSIGNED EGRESSIPS
      egressip-3   172.22.0.196   worker-1        172.22.0.196
      
      

              mkennell@redhat.com Martin Kennelly
              huirwang Huiran Wang
              Huiran Wang Huiran Wang
              Votes:
              0 Vote for this issue
              Watchers:
              7 Start watching this issue

                Created:
                Updated:
                Resolved: