Uploaded image for project: 'Agent-based Installer for OpenShift'
  1. Agent-based Installer for OpenShift
  2. AGENT-588

In agent wait-for add additional info when in pending-user-action

XMLWordPrintable

    • Icon: Bug Bug
    • Resolution: Duplicate
    • Icon: Normal Normal
    • None
    • openshift-4.12.z
    • None
    • False
    • Hide

      None

      Show
      None
    • False

      When in the `agent wait-for bootstrap complete` if the status is pending-user-action it would be useful to include additional status_info to help debug the problem without having to extract the user logs. For example, we had a case in which the wait-for was showing:
      INFO Bootstrap Kube API Initialized
      DEBUG Agent Rest API Initialized
      INFO Cluster has hosts requiring user input
      INFO cluster has stopped installing... working to recover installation
      INFO cluster has stopped installing... working to recover installation

      Looking at the logs we could see that the boot order was not set correctly, as indicated in the status_info

      Mar 29 10:00:51 master-0 installer[4941]: time="2023-03-29T14:00:51Z" level=info msg="Found 0 ready master nodes"
      Mar 29 10:00:51 master-0 service[2770]: time="2023-03-29T14:00:51Z" level=info msg="Register host: {HTTPRequest:0xc001a32400 DiscoveryAgentVersion:0xc000eda1a0 InfraEnvID:91cce259-da1b-4303-a91b-75110a33d339 NewHostParams:0xc001066030}" func="github.com/openshift/assisted-service/internal/bminventory.(*bareMetalInventory).V2RegisterHost" file="/src/internal/bminventory/inventory.go:4751" go-id=2017 infra_env_id=91cce259-da1b-4303-a91b-75110a33d339 pkg=Inventory request_id=707966ca-c605-4170-b3e4-e73870d76248
      Mar 29 10:00:51 master-0 service[2770]: time="2023-03-29T14:00:51Z" level=info msg="host e1589ff6-c9a6-4aa1-b310-865faa4a8b12 from infra env 91cce259-da1b-4303-a91b-75110a33d339 has been updated with the following updates [status installing-pending-user-action status_info Expected the host to boot from disk, but it booted the installation image - please reboot and fix boot order to boot from disk QEMU_HARDDISK drive-scsi0-0-0-1 (sda, /dev/disk/by-path/pci-0000:07:00.0-scsi-0:0:0:1) status_updated_at 2023-03-29T14:00:51.642Z trigger_monitor_timestamp 2023-03-29 14:00:51.64298203 +0000 UTC m=+454.575090669]" func=github.com/openshift/assisted-service/internal/host/hostutil.UpdateHostStatus file="/src/internal/host/hostutil/update_host.go:77" go-id=2017 infra_env_id=91cce259-da1b-4303-a91b-75110a33d339 pkg=host-state request_id=707966ca-c605-4170-b3e4-e73870d76248

       

      It this status_info was shown in the wait-for it would have save a lot of debug time.

       

              Unassigned Unassigned
              bfournie@redhat.com Robert Fournier
              Votes:
              0 Vote for this issue
              Watchers:
              1 Start watching this issue

                Created:
                Updated:
                Resolved: