Uploaded image for project: 'OpenShift Bugs'
  1. OpenShift Bugs
  2. OCPBUGS-45122

"openshift-install create cluster" failed

XMLWordPrintable

    • None
    • 1
    • Metal Platform 264, Metal Platform 265
    • 2
    • False
    • Hide

      None

      Show
      None

      Description of problem:

      The ""openshift-install create cluster" failed with the following error:

      failed to fetch Cluster Infrastructure Variables: failed to fetch dependency of "Cluster Infrastructure Variables": failed to generate asset "Platform Provisioning Check": platform.baremetal.BootstrapOSImage: Not found: "http://192.168.111.1/images/rhcos-416.94.202405291527-0-qemu.x86_64.qcow2.gz?sha256=2f0a1a28aa29995106380846e9e4aefd758e57c5b17a2489137e9dddc8aeb43e"
      

      Version-Release number of selected component (if applicable):
      OPENSHIFT_VERSION=4.16.0

      How reproducible:
      90%

      Steps to Reproduce:
      1. use https://github.com/openshift-metal3/dev-scripts.git or https://github.com/openstack-k8s-operators/ci-framework.git to build a OCP environment
      2. follow the guidance to run the automation

      Actual results:
      Both auto installations fail with the same problem when run the command:"openshift-install --dir ocp/ocp --log-level=debug create cluster"
      The related log:

         +(utils.sh:182): create_cluster(): '[' '!' -z '' ']'
          +(utils.sh:194): create_cluster(): trap auth_template_and_removetmp EXIT
          +(utils.sh:195): create_cluster(): grep --line-buffered -v 'password\|X-Auth-Token\|UserData:'
          +(utils.sh:195): create_cluster(): ocp/ocp/openshift-install --dir ocp/ocp --log-level=debug create cluster
          level=debug msg=OpenShift Installer 4.16.0
          level=debug msg=Built from commit 0dc3033888f4cae22e5e5897921422f7180c4033
          level=debug msg=Fetching Metadata...
          level=debug msg=Loading Metadata...
       ...
          level=debug msg=  Fetching Platform Provisioning Check...
          level=debug msg=    Fetching Install Config...
          level=debug msg=    Reusing previously-fetched Install Config
          level=debug msg=  Generating Platform Provisioning Check...
          level=fatal msg=failed to fetch Cluster Infrastructure Variables: failed to fetch dependency of "Cluster Infrastructure Variables": failed to generate asset "Platform Provisioning Check": platform.baremetal.BootstrapOSImage: Not found: "http://192.168.111.1/images/rhcos-416.94.202405291527-0-qemu.x86_64.qcow2.gz?sha256=2f0a1a28aa29995106380846e9e4aefd758e57c5b17a2489137e9dddc8aeb43e"
          +(utils.sh:1): create_cluster(): auth_template_and_removetmp
          +(utils.sh:879): auth_template_and_removetmp(): echo 1
          +(utils.sh:880): auth_template_and_removetmp(): generate_auth_template
          +(utils.sh:327): generate_auth_template(): set +x
          error: Missing or incomplete configuration info.  Please point to an existing, complete config file:
        
        
            1. Via the command-line flag --kubeconfig
            2. Via the KUBECONFIG environment variable
            3. In your home directory as ~/.kube/config
        
          To view or setup config directly use the 'config' command.
          make: *** [Makefile:67: ocp_run] Error 1
        stdout_lines: <omitted>
      

      Expected results:
      The OCP environment can be built successfully

      Additional info:
      (1) The host I used to build the OCP environment :

       hostname : dell-per7525-23.lab.eng.pek2.redhat.com
           OS:Red Hat Enterprise Linux 9.4 (Plow)
           memory : 128G
           CPU(s):                   64
           disk:1.1T
      
      # export
           declare -x HTTPS_PROXY="squid.redhat.com:3128"
           declare -x HTTP_PROXY="squid.redhat.com:3128"
      
      # lshw -c network -businfo
      Network interfaces          
      Bus info          Device     Class          Description
      =======================================================
      pci@0000:21:00.0  ens2f0     network        NetXtreme II BCM57810 10 Gigabit Ethernet
      pci@0000:21:00.1  ens2f1     network        NetXtreme II BCM57810 10 Gigabit Ethernet
      pci@0000:e2:00.0  ens7f0     network        82599ES 10-Gigabit SFI/SFP+ Network Connection
      pci@0000:e2:00.1  ens7f1     network        82599ES 10-Gigabit SFI/SFP+ Network Connection
      pci@0000:e1:00.0  eno8303    network        NetXtreme BCM5720 Gigabit Ethernet PCIe
      pci@0000:e1:00.1  eno8403    network        NetXtreme BCM5720 Gigabit Ethernet PCIe
      

      (2)
      This issue only be reproduced on all my "PEK2" machines currently.
      When I repeated the same steps on the "RDU2" machine, the deployment finishes successfully.

              rpittau@redhat.com Riccardo Pittau
              yanghliu@redhat.com YangHang Liu
              Gaoyun Pei Gaoyun Pei
              YangHang Liu
              Votes:
              0 Vote for this issue
              Watchers:
              5 Start watching this issue

                Created:
                Updated:
                Resolved: