Uploaded image for project: 'OpenShift Bugs'
  1. OpenShift Bugs
  2. OCPBUGS-42525

ABI Installation failing for compact and HA clusters in vSphere environment

XMLWordPrintable

    • Critical
    • None
    • Installer Sprint 260, Installer Sprint 261
    • 2
    • Proposed
    • False
    • Hide

      None

      Show
      None
    • Hide
      Cause: The agent-based installer (ABI) set assisted-service logging mode to debug. Setting debug had the unintended consequence of also turning on pprof for assisted-service which runs on port 6060.

      Consequence: In ABI, the cloud-credential-operator does not start because it finds a port 6060 conflict where it also runs pprof. Because the cloud-credential-operator does not run, secrets for vSphere are not generated when requested by the vSphere cloud-controller-manager. The CCM is not able to initialize the nodes and blocks the cluster installation.

      Fix: assisted-service does not run pprof if the invoker is the agent-installer.

      Result: The cloud-credential operator is able to run and cluster installations on vSphere using the agent-based installer are able succeed.
      Show
      Cause: The agent-based installer (ABI) set assisted-service logging mode to debug. Setting debug had the unintended consequence of also turning on pprof for assisted-service which runs on port 6060. Consequence: In ABI, the cloud-credential-operator does not start because it finds a port 6060 conflict where it also runs pprof. Because the cloud-credential-operator does not run, secrets for vSphere are not generated when requested by the vSphere cloud-controller-manager. The CCM is not able to initialize the nodes and blocks the cluster installation. Fix: assisted-service does not run pprof if the invoker is the agent-installer. Result: The cloud-credential operator is able to run and cluster installations on vSphere using the agent-based installer are able succeed.
    • Bug Fix
    • In Progress

      Description of problem:

      The installation of compact and HA clusters is failing in the vSphere environment. During the cluster setup, two master nodes were observed to be in a "Not Ready" state, and the rendezvous host failed to join the cluster. 

      Version-Release number of selected component (if applicable):

      4.17.0-0.nightly-2024-09-25-131159    

      How reproducible:

      100%    

      Actual results:

      level=info msg=Cluster operator cloud-controller-manager TrustedCABundleControllerControllerAvailable is True with AsExpected: Trusted CA Bundle Controller works as expected
      level=info msg=Cluster operator cloud-controller-manager TrustedCABundleControllerControllerDegraded is False with AsExpected: Trusted CA Bundle Controller works as expected
      level=info msg=Cluster operator cloud-controller-manager CloudConfigControllerAvailable is True with AsExpected: Cloud Config Controller works as expected
      level=info msg=Cluster operator cloud-controller-manager CloudConfigControllerDegraded is False with AsExpected: Cloud Config Controller works as expected
      level=info msg=Use the following commands to gather logs from the cluster
      level=info msg=openshift-install gather bootstrap --help
      level=error msg=Bootstrap failed to complete: : bootstrap process timed out: context deadline exceeded
      ERROR: Bootstrap failed. Aborting execution.

      Expected results:

      Installation should be successful.    

      Additional info:

      Agent Gather: https://gcsweb-qe-private-deck-ci.apps.ci.l2s4.p1.openshiftapps.com/gcs/qe-private-deck/pr-logs/pull/openshift_release/54459/rehearse-54459-periodic-ci-openshift-openshift-tests-private-release-4.17-amd64-nightly-vsphere-agent-compact-fips-f14/1839389511629410304/artifacts/vsphere-agent-compact-fips-f14/cucushift-agent-gather/artifacts/agent-gather.tar.xz

              bfournie@redhat.com Robert Fournier
              rhn-support-mhans Manoj Hans
              Manoj Hans Manoj Hans
              Votes:
              0 Vote for this issue
              Watchers:
              7 Start watching this issue

                Created:
                Updated: