Uploaded image for project: 'OpenShift Bugs'
  1. OpenShift Bugs
  2. OCPBUGS-41332

Some worker nodes fail to pull ignition while ocp 4.16 IPI installation on openstack.

XMLWordPrintable

    • +
    • Critical
    • None
    • ShiftStack Sprint 259
    • 1
    • False
    • Hide

      None

      Show
      None

      Team,

      Customer is installing the cluster with 3 masters and 3 workers. He is using Floating IPs here. The problem is only 1 worker is getting added and other 2 workers are failing to pull the ignition.

      ~~~

      oc get nodes

      NAME                       STATUS   ROLES                  AGE   VERSION

      ocp-kntph-master-0         Ready    control-plane,master   55m   v1.29.7+4510e9c

      ocp-kntph-master-1         Ready    control-plane,master   56m   v1.29.7+4510e9c

      ocp-kntph-master-2         Ready    control-plane,master   56m   v1.29.7+4510e9c

      ocp-kntph-worker-0-7lclr   Ready    worker                 24m   v1.29.7+4510e9c

      ~~~

       

      Console logs of failed worker node :

      [76168.418038] ignition[1019]: GET error: Get "https://10.0.0.5:22623/config/worker": net/http: TLS handshake timeout

      [76173.422184] ignition[1019]: GET[ https://10.0.0.5:22623/config/worker|https://10.0.0.5:22623/config/worker]: attempt #5090

      [76183.428258] ignition[1019]: GET error: Get "https://10.0.0.5:22623/config/worker": net/http: TLS handshake timeout

      [76188.428416] ignition[1019]: GET[ https://10.0.0.5:22623/config/worker|https://10.0.0.5:22623/config/worker]: attempt #5091

      [76198.429415] ignition[1019]: GET error: Get "https://10.0.0.5:22623/config/worker": net/http: TLS handshake timeout

      [76203.433545] ignition[1019]: GET[ https://10.0.0.5:22623/config/worker|https://10.0.0.5:22623/config/worker]: attempt #5092

      [76213.439552] ignition[1019]: GET error: Get "https://10.0.0.5:22623/config/worker": net/http: TLS handshake timeout

       

      When I tried to curl it from Ready worker, it was not able to pull it. Also, it's failing from all master nodes.

      ~~~

      sh-5.1# curl -k[ https://10.0.0.5:22623/config/worker|https://10.0.0.5:22623/config/worker] | jq .

        % Total    % Received % Xferd  Average Speed   Time    Time     Time  Current

                                       Dload  Upload   Total   Spent    Left  Speed

        0     0    0     0    0     0      0      0 -::- -::- -::-     0

      curl: (7) Failed to connect to[ 10.0.0.5|http://10.0.0.5/] port 22623: Connection refused

      ~~~

              maandre@redhat.com Martin André
              rhn-support-cchouhan Chandan Chouhan
              Itshak Brown Itshak Brown
              Votes:
              0 Vote for this issue
              Watchers:
              6 Start watching this issue

                Created:
                Updated:
                Resolved: