Uploaded image for project: 'OpenShift Bugs'
  1. OpenShift Bugs
  2. OCPBUGS-43387

ABI vSphere cluster installation fails due to etcd operator stuck in In-Progress status

XMLWordPrintable

    • Critical
    • None
    • Rejected
    • False
    • Hide

      None

      Show
      None

      Description of problem:

      The ABI cluster installation appeared successful because the agent wait-for install-complete command only checks if all operators are available, but in reality, the etcd operator was still in progress.
      
      I attempted to debug the issue and discovered that the etcd pod on the rendezvous node was in a CrashLoopBackOff state.
      
              message: |
                /bin/sh: line 4: NODE_ci_ln_8hx21zb_c1627_master_0_ETCD_URL_HOST: not set
              reason: Error
              startedAt: "2024-10-15T16:25:07Z"
          name: etcd-ensure-env-vars
          ready: false
          restartCount: 6
          started: false
          state:
            waiting:
              message: back-off 5m0s restarting failed container=etcd-ensure-env-vars pod=etcd-ci-ln-8hx21zb-c1627-master-0_openshift-etcd(4e70d2cb4965143304a7aaf1d7e91a5e)
              reason: CrashLoopBackOff 

      Version-Release number of selected component (if applicable):

      4.18.0-0.nightly-2024-10-15-032107

      How reproducible:

      Always

      Actual results:

      ClusterOperators:664 clusteroperator/etcd is progressing: NodeInstallerProgressing: 1 node is at revision 0; 1 node is at revision 5; 1 node is at revision 6

      Expected results:

      Installation should be successful.  

      Gather-extra: https://gcsweb-ci.apps.ci.l2s4.p1.openshiftapps.com/gcs/test-platform-results/logs/release-openshift-origin-installer-launch-vsphere-modern/1846111957640286208/artifacts/launch/gather-extra/ 

      Must-gather: https://gcsweb-ci.apps.ci.l2s4.p1.openshiftapps.com/gcs/test-platform-results/logs/release-openshift-origin-installer-launch-vsphere-modern/1846111957640286208/artifacts/launch/gather-must-gather/

              rwsu1@redhat.com Richard Su
              rhn-support-mhans Manoj Hans
              Manoj Hans Manoj Hans
              Votes:
              0 Vote for this issue
              Watchers:
              1 Start watching this issue

                Created:
                Updated: