Uploaded image for project: 'OpenShift Bugs'
  1. OpenShift Bugs
  2. OCPBUGS-51382

Agent error "installation command failed"

XMLWordPrintable

    • Icon: Bug Bug
    • Resolution: Done
    • Icon: Major Major
    • None
    • 4.17, 4.18
    • HyperShift / Agent
    • None
    • Quality / Stability / Reliability
    • False
    • Hide

      None

      Show
      None
    • None
    • None
    • Yes
    • None
    • None
    • None
    • None
    • None
    • None
    • None
    • None
    • None
    • None
    • None

      Description of problem:

      Hypershift deployment fails for agents failed installation:
      [kni@ocp-edge77 ~]$ oc get agent -n hosted-0
      NAME                                   CLUSTER    APPROVED   ROLE     STAGE
      020bb6ae-1b3d-4b57-a533-0515b415c73c   hosted-0   true       worker   Done
      5f04f6e8-d8a2-4c0c-b4ce-1b4f06a2a4ae   hosted-0   true       worker   
      e4fc255a-cd66-4092-be37-de937277b86f   hosted-0   true       worker   
      e6ee2ceb-80d2-454e-b9dd-160fe5aec1af   hosted-0   true       worker   Done
      ee148577-bac1-41f5-bbe6-381bd358f66f   hosted-0   true       worker   Done
      f5560e24-1b2c-44e5-a9f1-c1e03aaf17ce   hosted-0   true       worker

      The following error appears in failed agent:

      [kni@ocp-edge77 ~]$ oc get agent -n hosted-0 5f04f6e8-d8a2-4c0c-b4ce-1b4f06a2a4ae -o json | jq '.status.debugInfo'
      {   
      "eventsURL": "https://assisted-service-multicluster-engine.apps.hub.qe.lab.redhat.com/api/assisted-install/v2/events?api_key=eyJhbGciOiJFUzI1NiIsInR5cCI6IkpXVCJ9.eyJpbmZyYV9lbnZfaWQiOiI4ODBkOWVmNC1hYjY2LTQyOGQtYTk0Mi1jZWRiZWJhNWQ2NWEifQ.iJkT3wShWdQD48QHKiTlC4E9H4ZAF9ZDNjBZVBwP_vym2xABU-jIU2nxgGTt9SdHpVvqNaNk4SeZnwWI3d1O3g&host_id=5f04f6e8-d8a2-4c0c-b4ce-1b4f06a2a4ae",   "logsURL": "https://assisted-service-multicluster-engine.apps.hub.qe.lab.redhat.com/api/assisted-install/v2/clusters/06cd2fff-bc70-4482-baf0-a88a82ce86fe/logs?api_key=eyJhbGciOiJFUzI1NiIsInR5cCI6IkpXVCJ9.eyJjbHVzdGVyX2lkIjoiMDZjZDJmZmYtYmM3MC00NDgyLWJhZjAtYTg4YTgyY2U4NmZlIn0.40_YZNXhjWZGPnhGOnLb-lnT09XF6oDnHGydeiBVgb87Ar1dklzq0CxBH3yqiq-xbFmzAmSrcTxfYPmnthq1gw&host_id=5f04f6e8-d8a2-4c0c-b4ce-1b4f06a2a4ae&logs_type=host",   "state": "error",   "stateInfo": "installation command failed" 
      }
      

      Version-Release number of selected component (if applicable):

      [kni@ocp-edge77 ~]$ oc version
      Client Version: 4.17.17
      Kustomize Version: v5.0.4-0.20230601165947-6ce0bf390ce3
      Server Version: 4.17.17
      Kubernetes Version: v1.30.7

      How reproducible:

          3/3

      Steps to Reproduce:

          1. Deployed hub cluster with 3 masters and hypershift cluster with 6 workers (I used this profile: https://jenkins-csb-kniqe-auto.dno.corp.redhat.com/job/CI/job/job-runner/4642/     )
      
      

      Actual results:

         1. Hosted cluster deployment fails due to:
      13:03:04  FAILED - RETRYING: [provisionhost-0-0]: Wait for agents to be done (2 retries left).
      13:03:35  FAILED - RETRYING: [provisionhost-0-0]: Wait for agents to be done (1 retries left).
      13:04:06  fatal: [provisionhost-0-0]: FAILED! => {"attempts": 100, "changed": true, "cmd": "oc get agent -n hosted-0 --no-headers=true | awk '{print $5}' | grep -c Done\n", "delta": "0:00:00.224220", "end": "2025-02-27 13:04:04.665182", "msg": "", "rc": 0, "start": "2025-02-27 13:04:04.440962", "stderr": "", "stderr_lines": [], "stdout": "3", "stdout_lines": ["3"]} 

      Expected results:

          Deployment is successful

      Additional info:

          

              cchun@redhat.com Crystal Chun
              rh-ee-ugreener Udi Greener
              None
              Gal Amado
              Udi Greener Udi Greener
              None
              Votes:
              0 Vote for this issue
              Watchers:
              3 Start watching this issue

                Created:
                Updated:
                Resolved: