Uploaded image for project: 'OpenShift Bugs'
  1. OpenShift Bugs
  2. OCPBUGS-20290

resolv-prepender performing a podman pull can cause timeout activating connections.

XMLWordPrintable

    • Important
    • No
    • False
    • Hide

      None

      Show
      None

      Description of problem:

      resolv-prepender performing a podman pull can cause timeout activating connections when network is down. 

      Version-Release number of selected component (if applicable):

      4.12

      How reproducible:

      Randomly on reboots

      Steps to Reproduce:

      1. Baremetal with bonding configured
      2. Reboot
      
      

      Actual results:

      configure-ovs.sh fails to configure interfaces for ovn

      Expected results:

      interfaces for ovn get configured

      Additional info:

      We are rebooting the baremetal host and randomly configure-ovs.sh fails to complete. 
      
      We have pinpointed this to the resolv-prepender dispatacher script running multiple times, before the configure.ovs.sh finishes. This results in a 90s timeout trying to activate *-slave-ovs-clone interface. This results in the configure-ovs.sh script to ultimately fail. 
      
      A `podman pull` requires network connectivity, the script should not be dependent on this succeeding: 
      
      https://github.com/openshift/machine-config-operator/blob/release-4.12/templates/common/on-prem/files/resolv-prepender.yaml#L22
      
      Instead we should either try 5 times then move on to running the rest of the script as the image is likely already local. 
      
      
      

              bnemec@redhat.com Benjamin Nemec
              rhn-support-rhowe Ryan Howe
              Sergio Regidor de la Rosa Sergio Regidor de la Rosa
              Votes:
              0 Vote for this issue
              Watchers:
              4 Start watching this issue

                Created:
                Updated:
                Resolved: