Uploaded image for project: 'OpenShift Bugs'
  1. OpenShift Bugs
  2. OCPBUGS-10313

The agent-tui shows again during the installation

    XMLWordPrintable

Details

    • No
    • False
    • Hide

      None

      Show
      None
    • N/A
    • Release Note Not Required

    Description

      Description of problem:

      Agent-tui should show before the installation, but it shows again during the installation and when it quit again, the installation fail to go on.

      Version-Release number of selected component (if applicable):

      4.13.0-0.ci-2023-03-14-045458

      How reproducible:

      always

      Steps to Reproduce:

      1. Make sure the primary check pass, and boot the agent.x86_64.iso file, we can see the agent-tui show before the installation
      
      2. Tracking installation by both wait-for output and console output
      
      3. The agent-tui show again during the installation, wait for the agent-tui quit automatically without any user interruption, the installation quit with failure, and we have the following wait-for output:
      
      DEBUG asset directory: .                           
      DEBUG Loading Agent Config...                      
      ...
      DEBUG Agent Rest API never initialized. Bootstrap Kube API never initialized 
      INFO Waiting for cluster install to initialize. Sleeping for 30 seconds 
      DEBUG Agent Rest API Initialized                   
      INFO Cluster is not ready for install. Check validations 
      DEBUG Cluster validation: The pull secret is set.  
      WARNING Cluster validation: The cluster has hosts that are not ready to install. 
      DEBUG Cluster validation: The cluster has the exact amount of dedicated control plane nodes. 
      DEBUG Cluster validation: API virtual IPs are not required: User Managed Networking 
      DEBUG Cluster validation: API virtual IPs are not required: User Managed Networking 
      DEBUG Cluster validation: The Cluster Network CIDR is defined. 
      DEBUG Cluster validation: The base domain is defined. 
      DEBUG Cluster validation: Ingress virtual IPs are not required: User Managed Networking 
      DEBUG Cluster validation: Ingress virtual IPs are not required: User Managed Networking 
      DEBUG Cluster validation: The Machine Network CIDR is defined. 
      DEBUG Cluster validation: The Cluster Machine CIDR is not required: User Managed Networking 
      DEBUG Cluster validation: The Cluster Network prefix is valid. 
      DEBUG Cluster validation: The cluster has a valid network type 
      DEBUG Cluster validation: Same address families for all networks. 
      DEBUG Cluster validation: No CIDRS are overlapping. 
      DEBUG Cluster validation: No ntp problems found    
      DEBUG Cluster validation: The Service Network CIDR is defined. 
      DEBUG Cluster validation: cnv is disabled          
      DEBUG Cluster validation: lso is disabled          
      DEBUG Cluster validation: lvm is disabled          
      DEBUG Cluster validation: odf is disabled          
      DEBUG Host openshift-qe-049.arm.eng.rdu2.redhat.com validation: Valid inventory exists for the host 
      DEBUG Host openshift-qe-049.arm.eng.rdu2.redhat.com validation: Sufficient CPU cores 
      DEBUG Host openshift-qe-049.arm.eng.rdu2.redhat.com validation: Sufficient minimum RAM 
      DEBUG Host openshift-qe-049.arm.eng.rdu2.redhat.com validation: Sufficient disk capacity 
      DEBUG Host openshift-qe-049.arm.eng.rdu2.redhat.com validation: Sufficient CPU cores for role master 
      DEBUG Host openshift-qe-049.arm.eng.rdu2.redhat.com validation: Sufficient RAM for role master 
      DEBUG Host openshift-qe-049.arm.eng.rdu2.redhat.com validation: Hostname openshift-qe-049.arm.eng.rdu2.redhat.com is unique in cluster 
      DEBUG Host openshift-qe-049.arm.eng.rdu2.redhat.com validation: Hostname openshift-qe-049.arm.eng.rdu2.redhat.com is allowed 
      DEBUG Host openshift-qe-049.arm.eng.rdu2.redhat.com validation: Speed of installation disk has not yet been measured 
      DEBUG Host openshift-qe-049.arm.eng.rdu2.redhat.com validation: Host is compatible with cluster platform none 
      DEBUG Host openshift-qe-049.arm.eng.rdu2.redhat.com validation: VSphere disk.EnableUUID is enabled for this virtual machine 
      DEBUG Host openshift-qe-049.arm.eng.rdu2.redhat.com validation: Host agent compatibility checking is disabled 
      DEBUG Host openshift-qe-049.arm.eng.rdu2.redhat.com validation: No request to skip formatting of the installation disk 
      DEBUG Host openshift-qe-049.arm.eng.rdu2.redhat.com validation: All disks that have skipped formatting are present in the host inventory 
      DEBUG Host openshift-qe-049.arm.eng.rdu2.redhat.com validation: Host is connected 
      DEBUG Host openshift-qe-049.arm.eng.rdu2.redhat.com validation: Media device is connected 
      DEBUG Host openshift-qe-049.arm.eng.rdu2.redhat.com validation: No Machine Network CIDR needed: User Managed Networking 
      DEBUG Host openshift-qe-049.arm.eng.rdu2.redhat.com validation: Host belongs to all machine network CIDRs 
      DEBUG Host openshift-qe-049.arm.eng.rdu2.redhat.com validation: Host has connectivity to the majority of hosts in the cluster 
      DEBUG Host openshift-qe-049.arm.eng.rdu2.redhat.com validation: Platform PowerEdge R740 is allowed 
      WARNING Host openshift-qe-049.arm.eng.rdu2.redhat.com validation: Host couldn't synchronize with any NTP server 
      DEBUG Host openshift-qe-049.arm.eng.rdu2.redhat.com validation: Host clock is synchronized with service 
      DEBUG Host openshift-qe-049.arm.eng.rdu2.redhat.com validation: All required container images were either pulled successfully or no attempt was made to pull them 
      DEBUG Host openshift-qe-049.arm.eng.rdu2.redhat.com validation: Network latency requirement has been satisfied. 
      DEBUG Host openshift-qe-049.arm.eng.rdu2.redhat.com validation: Packet loss requirement has been satisfied. 
      DEBUG Host openshift-qe-049.arm.eng.rdu2.redhat.com validation: Host has been configured with at least one default route. 
      DEBUG Host openshift-qe-049.arm.eng.rdu2.redhat.com validation: Domain name resolution for the api.zniusno.arm.eng.rdu2.redhat.com domain was successful or not required 
      DEBUG Host openshift-qe-049.arm.eng.rdu2.redhat.com validation: Domain name resolution for the api-int.zniusno.arm.eng.rdu2.redhat.com domain was successful or not required 
      DEBUG Host openshift-qe-049.arm.eng.rdu2.redhat.com validation: Domain name resolution for the *.apps.zniusno.arm.eng.rdu2.redhat.com domain was successful or not required 
      DEBUG Host openshift-qe-049.arm.eng.rdu2.redhat.com validation: Host subnets are not overlapping 
      DEBUG Host openshift-qe-049.arm.eng.rdu2.redhat.com validation: No IP collisions were detected by host 7a9649d8-4167-a1f9-ad5f-385c052e2744 
      DEBUG Host openshift-qe-049.arm.eng.rdu2.redhat.com validation: cnv is disabled 
      DEBUG Host openshift-qe-049.arm.eng.rdu2.redhat.com validation: lso is disabled 
      DEBUG Host openshift-qe-049.arm.eng.rdu2.redhat.com validation: lvm is disabled 
      DEBUG Host openshift-qe-049.arm.eng.rdu2.redhat.com validation: odf is disabled 
      WARNING Host openshift-qe-049.arm.eng.rdu2.redhat.com: updated status from discovering to insufficient (Host cannot be installed due to following failing validation(s): Host couldn't synchronize with any NTP server) 
      INFO Host openshift-qe-049.arm.eng.rdu2.redhat.com validation: Host NTP is synced 
      INFO Host openshift-qe-049.arm.eng.rdu2.redhat.com: updated status from insufficient to known (Host is ready to be installed) 
      INFO Cluster is ready for install                 
      INFO Cluster validation: All hosts in the cluster are ready to install. 
      INFO Preparing cluster for installation           
      INFO Host openshift-qe-049.arm.eng.rdu2.redhat.com: updated status from known to preparing-for-installation (Host finished successfully to prepare for installation) 
      INFO Host openshift-qe-049.arm.eng.rdu2.redhat.com: New image status registry.ci.openshift.org/ocp/4.13-2023-03-14-045458@sha256:b0d518907841eb35adbc05962d4b2e7d45abc90baebc5a82d0398e1113ec04d0. result: success. time: 1.35 seconds; size: 401.45 Megabytes; download rate: 312.54 MBps 
      INFO Host openshift-qe-049.arm.eng.rdu2.redhat.com: updated status from preparing-for-installation to preparing-successful (Host finished successfully to prepare for installation) 
      INFO Cluster installation in progress             
      INFO Host openshift-qe-049.arm.eng.rdu2.redhat.com: updated status from preparing-successful to installing (Installation is in progress) 
      INFO Host: openshift-qe-049.arm.eng.rdu2.redhat.com, reached installation stage Starting installation: bootstrap 
      INFO Host: openshift-qe-049.arm.eng.rdu2.redhat.com, reached installation stage Installing: bootstrap 
      INFO Host: openshift-qe-049.arm.eng.rdu2.redhat.com, reached installation stage Failed: failed executing nsenter [--target 1 --cgroup --mount --ipc --pid -- podman run --net host --pid=host --volume /:/rootfs:rw --volume /usr/bin/rpm-ostree:/usr/bin/rpm-ostree --privileged --entrypoint /usr/bin/machine-config-daemon registry.ci.openshift.org/ocp/4.13-2023-03-14-045458@sha256:f85a278868035dc0a40a66ea7eaf0877624ef9fde9fc8df1633dc5d6d1ad4e39 start --node-name localhost --root-mount /rootfs --once-from /opt/install-dir/bootstrap.ign --skip-reboot], Error exit status 255, LastOutput "...  to initialize single run daemon: error initializing rpm-ostree: Error while ensuring access to kublet config.json pull secrets: symlink /var/lib/kubelet/config.json /run/ostree/auth.json: file exists" 
      INFO Cluster has hosts in error                   
      INFO cluster has stopped installing... working to recover installation 
      INFO cluster has stopped installing... working to recover installation 
      INFO cluster has stopped installing... working to recover installation 
      INFO cluster has stopped installing... working to recover installation 
      INFO cluster has stopped installing... working to recover installation 
      INFO cluster has stopped installing... working to recover installation 
      INFO cluster has stopped installing... working to recover installation 
      INFO cluster has stopped installing... working to recover installation   
      
      4. During the installation, we had NetworkManager-wait-online.service for a while:
      -- Logs begin at Wed 2023-03-15 03:06:29 UTC, end at Wed 2023-03-15 03:27:30 UTC. --
      Mar 15 03:18:52 openshift-qe-049.arm.eng.rdu2.redhat.com systemd[1]: Starting Network Manager Wait Online...
      Mar 15 03:19:55 openshift-qe-049.arm.eng.rdu2.redhat.com systemd[1]: NetworkManager-wait-online.service: Main process exited, code=exited, status=1/FAILURE
      Mar 15 03:19:55 openshift-qe-049.arm.eng.rdu2.redhat.com systemd[1]: NetworkManager-wait-online.service: Failed with result 'exit-code'.
      Mar 15 03:19:55 openshift-qe-049.arm.eng.rdu2.redhat.com systemd[1]: Failed to start Network Manager Wait Online.

      Expected results:

      The TUI should only show once before the installation.

      Attachments

        Issue Links

          Activity

            People

              afasano@redhat.com Andrea Fasano
              rh-ee-zniu zhenying niu
              zhenying niu zhenying niu
              Votes:
              0 Vote for this issue
              Watchers:
              10 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: