Uploaded image for project: 'OpenShift Bugs'
  1. OpenShift Bugs
  2. OCPBUGS-56615

Entering malformed SSH key to install-config.yaml causes installation to hang for one hour and finally fails with no clear error message

XMLWordPrintable

    • Quality / Stability / Reliability
    • False
    • Hide

      None

      Show
      None
    • None
    • Moderate
    • No
    • None
    • None
    • None
    • None
    • None
    • None
    • None
    • None
    • None
    • None
    • None

      Description of problem:

          As part of testing EPIC https://issues.redhat.com/browse/ACM-15286, I entered a malformed SSH key to install-config.yaml file and ran the installation. The installer hanged for one hour until it finally failed without a clear error message:

       

      time="2025-05-17T12:03:46-04:00" level=info msg="  baremetalhost: openshift-master-0-0: uninitialized"
      time="2025-05-17T12:03:47-04:00" level=info msg="  baremetalhost: openshift-master-0-1: uninitialized"
      time="2025-05-17T12:03:47-04:00" level=info msg="  baremetalhost: openshift-master-0-2: uninitialized"
      time="2025-05-17T13:02:15-04:00" level=error msg="Bootstrap failed to complete: failed to provision control-plane machines: timed out waiting for the condition"
      time="2025-05-17T13:02:15-04:00" level=error msg="Failed to wait for bootstrapping to complete. This error usually happens when there is a problem with control plane hosts that prevents the control plane operators from creating the control plane."
      time="2025-05-17T13:02:15-04:00" level=debug msg="Fetching Bootstrap SSH Key Pair..."
      time="2025-05-17T13:02:15-04:00" level=debug msg="Loading Bootstrap SSH Key Pair..."
      time="2025-05-17T13:02:15-04:00" level=debug msg="Using Bootstrap SSH Key Pair loaded from state file"
      time="2025-05-17T13:02:15-04:00" level=debug msg="Reusing previously-fetched Bootstrap SSH Key Pair"
      time="2025-05-17T13:02:15-04:00" level=debug msg="Fetching Install Config..."
      time="2025-05-17T13:02:15-04:00" level=debug msg="Loading Install Config..."
      time="2025-05-17T13:02:15-04:00" level=debug msg="  Loading SSH Key..."
      time="2025-05-17T13:02:15-04:00" level=debug msg="  Loading Base Domain..."
      time="2025-05-17T13:02:15-04:00" level=debug msg="    Loading Platform..."
      time="2025-05-17T13:02:15-04:00" level=debug msg="  Loading Cluster Name..."
      time="2025-05-17T13:02:15-04:00" level=debug msg="    Loading Base Domain..."
      time="2025-05-17T13:02:15-04:00" level=debug msg="    Loading Platform..."
      time="2025-05-17T13:02:15-04:00" level=debug msg="  Loading Pull Secret..."
      time="2025-05-17T13:02:15-04:00" level=debug msg="  Loading Platform..."
      time="2025-05-17T13:02:15-04:00" level=debug msg="Using Install Config loaded from state file"
      time="2025-05-17T13:02:15-04:00" level=debug msg="Reusing previously-fetched Install Config"
      time="2025-05-17T13:02:15-04:00" level=debug msg="baremetal: getting master addresses"
      time="2025-05-17T13:02:15-04:00" level=debug msg="  bmh:openshift-master-0-0"
      time="2025-05-17T13:02:15-04:00" level=debug msg="    HardwareDetails nil, skipping"
      time="2025-05-17T13:02:15-04:00" level=debug msg="  bmh:openshift-master-0-1"
      time="2025-05-17T13:02:15-04:00" level=debug msg="    HardwareDetails nil, skipping"
      time="2025-05-17T13:02:15-04:00" level=debug msg="  bmh:openshift-master-0-2"
      time="2025-05-17T13:02:15-04:00" level=debug msg="    HardwareDetails nil, skipping"
      time="2025-05-17T13:02:15-04:00" level=error msg="Attempted to gather debug logs after installation failure: must provide bootstrap host address" 

       

       

      Version-Release number of selected component (if applicable):

      OpenShift Installer 4.19.0-0.nightly-2025-05-15-095920    

      How reproducible:

      1/1    

      Steps to Reproduce:

          1. Create 3 SSH key pairs (key#1-4) for kni user
          2. Edit ~/install-config.yaml and insert to it public keys:
       key#1, malformed key, and key#3 as described here:
       https://github.com/stolostron/rhacm-docs/blob/80caff6e57e8ac32cf3bcccad7d4af718899055c/clusters/install_upgrade/install_disconnected.adoc      
          3. Run: /openshift-install --dir clusterconfigs --log-level debug create cluster    

      Actual results:

          Installer hangs for 1 hour and finally fails without clear error message

      Expected results:

      Installer validates the install-config.yaml file and return a corresponding error message right on the start of the installation    

      Additional info:

          

              Unassigned Unassigned
              rh-ee-ugreener Udi Greener
              None
              None
              Gaoyun Pei Gaoyun Pei
              None
              Votes:
              0 Vote for this issue
              Watchers:
              2 Start watching this issue

                Created:
                Updated:
                Resolved: