-
Bug
-
Resolution: Done
-
Normal
-
None
-
4.16
-
None
-
None
-
1
-
Metal Platform 264, Metal Platform 265
-
2
-
False
-
Description of problem:
The ""openshift-install create cluster" failed with the following error:
failed to fetch Cluster Infrastructure Variables: failed to fetch dependency of "Cluster Infrastructure Variables": failed to generate asset "Platform Provisioning Check": platform.baremetal.BootstrapOSImage: Not found: "http://192.168.111.1/images/rhcos-416.94.202405291527-0-qemu.x86_64.qcow2.gz?sha256=2f0a1a28aa29995106380846e9e4aefd758e57c5b17a2489137e9dddc8aeb43e"
Version-Release number of selected component (if applicable):
OPENSHIFT_VERSION=4.16.0
How reproducible:
90%
Steps to Reproduce:
1. use https://github.com/openshift-metal3/dev-scripts.git or https://github.com/openstack-k8s-operators/ci-framework.git to build a OCP environment
2. follow the guidance to run the automation
Actual results:
Both auto installations fail with the same problem when run the command:"openshift-install --dir ocp/ocp --log-level=debug create cluster"
The related log:
+(utils.sh:182): create_cluster(): '[' '!' -z '' ']' +(utils.sh:194): create_cluster(): trap auth_template_and_removetmp EXIT +(utils.sh:195): create_cluster(): grep --line-buffered -v 'password\|X-Auth-Token\|UserData:' +(utils.sh:195): create_cluster(): ocp/ocp/openshift-install --dir ocp/ocp --log-level=debug create cluster level=debug msg=OpenShift Installer 4.16.0 level=debug msg=Built from commit 0dc3033888f4cae22e5e5897921422f7180c4033 level=debug msg=Fetching Metadata... level=debug msg=Loading Metadata... ... level=debug msg= Fetching Platform Provisioning Check... level=debug msg= Fetching Install Config... level=debug msg= Reusing previously-fetched Install Config level=debug msg= Generating Platform Provisioning Check... level=fatal msg=failed to fetch Cluster Infrastructure Variables: failed to fetch dependency of "Cluster Infrastructure Variables": failed to generate asset "Platform Provisioning Check": platform.baremetal.BootstrapOSImage: Not found: "http://192.168.111.1/images/rhcos-416.94.202405291527-0-qemu.x86_64.qcow2.gz?sha256=2f0a1a28aa29995106380846e9e4aefd758e57c5b17a2489137e9dddc8aeb43e" +(utils.sh:1): create_cluster(): auth_template_and_removetmp +(utils.sh:879): auth_template_and_removetmp(): echo 1 +(utils.sh:880): auth_template_and_removetmp(): generate_auth_template +(utils.sh:327): generate_auth_template(): set +x error: Missing or incomplete configuration info. Please point to an existing, complete config file: 1. Via the command-line flag --kubeconfig 2. Via the KUBECONFIG environment variable 3. In your home directory as ~/.kube/config To view or setup config directly use the 'config' command. make: *** [Makefile:67: ocp_run] Error 1 stdout_lines: <omitted>
Expected results:
The OCP environment can be built successfully
Additional info:
(1) The host I used to build the OCP environment :
hostname : dell-per7525-23.lab.eng.pek2.redhat.com OS:Red Hat Enterprise Linux 9.4 (Plow) memory : 128G CPU(s): 64 disk:1.1T
# export declare -x HTTPS_PROXY="squid.redhat.com:3128" declare -x HTTP_PROXY="squid.redhat.com:3128"
# lshw -c network -businfo Network interfaces Bus info Device Class Description ======================================================= pci@0000:21:00.0 ens2f0 network NetXtreme II BCM57810 10 Gigabit Ethernet pci@0000:21:00.1 ens2f1 network NetXtreme II BCM57810 10 Gigabit Ethernet pci@0000:e2:00.0 ens7f0 network 82599ES 10-Gigabit SFI/SFP+ Network Connection pci@0000:e2:00.1 ens7f1 network 82599ES 10-Gigabit SFI/SFP+ Network Connection pci@0000:e1:00.0 eno8303 network NetXtreme BCM5720 Gigabit Ethernet PCIe pci@0000:e1:00.1 eno8403 network NetXtreme BCM5720 Gigabit Ethernet PCIe
(2)
This issue only be reproduced on all my "PEK2" machines currently.
When I repeated the same steps on the "RDU2" machine, the deployment finishes successfully.