Uploaded image for project: 'OpenShift Bugs'
  1. OpenShift Bugs
  2. OCPBUGS-35372

Installer sometimes fails to attach the bootstrap FIP when additional networks are set

XMLWordPrintable

    • +
    • Important
    • Yes
    • ShiftStack Sprint 255
    • 1
    • Rejected
    • False
    • Hide

      None

      Show
      None
    • Release Note Not Required
    • In Progress

      This is a clone of issue OCPBUGS-34005. The following is the description of the original issue:

      Description of problem:

      Intermittent error during the installation process when enabling Cluster API (CAPI) in the install-config for OCP 4.16 tech preview IPI installation on top of OSP. The error occurs during the post-machine creation hook, specifically related to Floating IP association.

      Version-Release number of selected component (if applicable):

      OCP: 4.16.0-0.nightly-2024-05-16-092402 TP enabled
      on top of
      OSP: RHOS-17.1-RHEL-9-20240123.n.1

      How reproducible:

      The issue occurs intermittently, sometimes the installation succeeds, and other times it fails.

      Steps to Reproduce:

          1.Install OSP
          2.Initiate OCP installation with TP and CAPI enabled
          3.Observe the installation logs of the failed installation.     

      Actual results:

          The installation fails intermittently with the following error message:
      ...
      2024-05-17 23:37:51.590 | level=debug msg=E0517 23:37:29.833599  266622 controller.go:329] "Reconciler error" err="failed to create cluster accessor: error creating http client and mapper for remote cluster \"openshift-cluster-api-guests/ostest-4qrz2\": error creating client for remote cluster \"openshift-cluster-api-guests/ostest-4qrz2\": error getting rest mapping: failed to get API group resources: unable to retrieve the complete list of server APIs: v1: Get \"https://api.ostest.shiftstack.com:6443/api/v1?timeout=10s\": net/http: request canceled while waiting for connection (Client.Timeout exceeded while awaiting headers)" controller="machine" controllerGroup="cluster.x-k8s.io" controllerKind="Machine" Machine="openshift-cluster-api-guests/ostest-4qrz2-master-0" namespace="openshift-cluster-api-guests" name="ostest-4qrz2-master-0" reconcileID="985ba50c-2a1d-41f6-b494-f5af7dca2e7b"
      2024-05-17 23:37:51.597 | level=debug msg=E0517 23:37:39.838706  266622 controller.go:329] "Reconciler error" err="failed to create cluster accessor: error creating http client and mapper for remote cluster \"openshift-cluster-api-guests/ostest-4qrz2\": error creating client for remote cluster \"openshift-cluster-api-guests/ostest-4qrz2\": error getting rest mapping: failed to get API group resources: unable to retrieve the complete list of server APIs: v1: Get \"https://api.ostest.shiftstack.com:6443/api/v1?timeout=10s\": net/http: request canceled while waiting for connection (Client.Timeout exceeded while awaiting headers)" controller="machine" controllerGroup="cluster.x-k8s.io" controllerKind="Machine" Machine="openshift-cluster-api-guests/ostest-4qrz2-master-0" namespace="openshift-cluster-api-guests" name="ostest-4qrz2-master-0" reconcileID="dfe5f138-ac8e-4790-948f-72d6c8631f21"
      2024-05-17 23:37:51.603 | level=debug msg=Machine ostest-4qrz2-master-0 is ready. Phase: Provisioned
      2024-05-17 23:37:51.610 | level=debug msg=Machine ostest-4qrz2-master-1 is ready. Phase: Provisioned
      2024-05-17 23:37:51.615 | level=debug msg=Machine ostest-4qrz2-master-2 is ready. Phase: Provisioned
      2024-05-17 23:37:51.619 | level=info msg=Control-plane machines are ready
      2024-05-17 23:37:51.623 | level=error msg=failed to fetch Cluster: failed to generate asset "Cluster": failed to create cluster: failed during post-machine creation hook: Resource not found: [POST https://10.46.44.159:13696/v2.0/floatingips], error message: {"NeutronError": {"type": "ExternalGatewayForFloatingIPNotFound", "message": "External network 654792e9-dead-485a-beec-f3c428ef71da is not reachable from subnet d9829374-f0de-4a41-a1c0-a2acdd4841da.  Therefore, cannot associate Port 01c518a9-5d5f-42d8-a090-6e3151e8af3f with a Floating IP.", "detail": ""}}
      2024-05-17 23:37:51.629 | level=info msg=Shutting down local Cluster API control plane...
      2024-05-17 23:37:51.637 | level=info msg=Stopped controller: Cluster API
      2024-05-17 23:37:51.643 | level=warning msg=process cluster-api-provider-openstack exited with error: signal: killed
      2024-05-17 23:37:51.653 | level=info msg=Stopped controller: openstack infrastructure provider
      2024-05-17 23:37:51.659 | level=info msg=Local Cluster API system has completed operations
      
      

      Expected results:

      The installation should complete successfully

      Additional info: CAPI is enabled by adding the following to the install-config: 

      featureSet: 'CustomNoUpgrade'
      featureGates: ['ClusterAPIInstall=true']

            pprinett@redhat.com Pierre Prinetti
            openshift-crt-jira-prow OpenShift Prow Bot
            Itay Matza Itay Matza
            Votes:
            0 Vote for this issue
            Watchers:
            9 Start watching this issue

              Created:
              Updated:
              Resolved: