-
Bug
-
Resolution: Done
-
Major
-
None
-
4.17, 4.18
-
None
-
Quality / Stability / Reliability
-
False
-
-
None
-
None
-
Yes
-
None
-
None
-
None
-
None
-
None
-
None
-
None
-
None
-
None
-
None
-
None
Description of problem:
Hypershift deployment fails for agents failed installation: [kni@ocp-edge77 ~]$ oc get agent -n hosted-0 NAME CLUSTER APPROVED ROLE STAGE 020bb6ae-1b3d-4b57-a533-0515b415c73c hosted-0 true worker Done 5f04f6e8-d8a2-4c0c-b4ce-1b4f06a2a4ae hosted-0 true worker e4fc255a-cd66-4092-be37-de937277b86f hosted-0 true worker e6ee2ceb-80d2-454e-b9dd-160fe5aec1af hosted-0 true worker Done ee148577-bac1-41f5-bbe6-381bd358f66f hosted-0 true worker Done f5560e24-1b2c-44e5-a9f1-c1e03aaf17ce hosted-0 true worker
The following error appears in failed agent:
[kni@ocp-edge77 ~]$ oc get agent -n hosted-0 5f04f6e8-d8a2-4c0c-b4ce-1b4f06a2a4ae -o json | jq '.status.debugInfo' { "eventsURL": "https://assisted-service-multicluster-engine.apps.hub.qe.lab.redhat.com/api/assisted-install/v2/events?api_key=eyJhbGciOiJFUzI1NiIsInR5cCI6IkpXVCJ9.eyJpbmZyYV9lbnZfaWQiOiI4ODBkOWVmNC1hYjY2LTQyOGQtYTk0Mi1jZWRiZWJhNWQ2NWEifQ.iJkT3wShWdQD48QHKiTlC4E9H4ZAF9ZDNjBZVBwP_vym2xABU-jIU2nxgGTt9SdHpVvqNaNk4SeZnwWI3d1O3g&host_id=5f04f6e8-d8a2-4c0c-b4ce-1b4f06a2a4ae", "logsURL": "https://assisted-service-multicluster-engine.apps.hub.qe.lab.redhat.com/api/assisted-install/v2/clusters/06cd2fff-bc70-4482-baf0-a88a82ce86fe/logs?api_key=eyJhbGciOiJFUzI1NiIsInR5cCI6IkpXVCJ9.eyJjbHVzdGVyX2lkIjoiMDZjZDJmZmYtYmM3MC00NDgyLWJhZjAtYTg4YTgyY2U4NmZlIn0.40_YZNXhjWZGPnhGOnLb-lnT09XF6oDnHGydeiBVgb87Ar1dklzq0CxBH3yqiq-xbFmzAmSrcTxfYPmnthq1gw&host_id=5f04f6e8-d8a2-4c0c-b4ce-1b4f06a2a4ae&logs_type=host", "state": "error", "stateInfo": "installation command failed" }
Version-Release number of selected component (if applicable):
[kni@ocp-edge77 ~]$ oc version Client Version: 4.17.17 Kustomize Version: v5.0.4-0.20230601165947-6ce0bf390ce3 Server Version: 4.17.17 Kubernetes Version: v1.30.7
How reproducible:
3/3
Steps to Reproduce:
1. Deployed hub cluster with 3 masters and hypershift cluster with 6 workers (I used this profile: https://jenkins-csb-kniqe-auto.dno.corp.redhat.com/job/CI/job/job-runner/4642/ )
Actual results:
1. Hosted cluster deployment fails due to: 13:03:04 FAILED - RETRYING: [provisionhost-0-0]: Wait for agents to be done (2 retries left). 13:03:35 FAILED - RETRYING: [provisionhost-0-0]: Wait for agents to be done (1 retries left). 13:04:06 fatal: [provisionhost-0-0]: FAILED! => {"attempts": 100, "changed": true, "cmd": "oc get agent -n hosted-0 --no-headers=true | awk '{print $5}' | grep -c Done\n", "delta": "0:00:00.224220", "end": "2025-02-27 13:04:04.665182", "msg": "", "rc": 0, "start": "2025-02-27 13:04:04.440962", "stderr": "", "stderr_lines": [], "stdout": "3", "stdout_lines": ["3"]}
Expected results:
Deployment is successful
Additional info:
- duplicates
-
OCPBUGS-50655 Assisted Installer Agent image pull timeout is too short
-
- Closed
-