Uploaded image for project: 'Agent-based deployment for OpenShift Installer'
  1. Agent-based deployment for OpenShift Installer
  2. AGENT-386

Report errors that occur prior to cluster installation

XMLWordPrintable

    • Icon: Epic Epic
    • Resolution: Unresolved
    • Icon: Major Major
    • None
    • None
    • None
    • Early error reporting
    • False
    • Hide

      None

      Show
      None
    • False
    • Not Selected
    • To Do

      OCP/Telco Definition of Done
      Epic Template descriptions and documentation.

      <--- Cut-n-Paste the entire contents of this description into your new Epic --->

      Epic Goal

      • Provide the user with detailed reports of any errors that occur during installation, with sufficient information to immediately detect and diagnose the problem - even if the error occurs prior to the step where cluster installation is started in the assisted-service.

      Why is this important?

      • Many of the common failure modes occur early in the process, and are currently detectable only from a timeout. When the failure occurs, the reason is not presented to the user and this will generate support requests. In most cases the behaviour is similar to IPI baremetal, but we aspire to better usability in the agent-based installer.

      Scenarios

      1. Unable to pull container images to run services
      2. Systemd services fail to start because of reasons (e.g. no host detects rendezvous IP)
      3. Host-specific configurations are not successfully applied (i.e. apply-host-config service does not complete, and errors are present in /var/run/agent-installer/host-config-failures)
      4. Insufficient agents are booted to begin the cluster installation (i.e. start-cluster-installation does not start the cluster installation).

      Acceptance Criteria

      • CI - MUST be running successfully with tests automated
      • Release Technical Enablement - Provide necessary release enablement details and documents.
      • ...

      Dependencies (internal and external)

      1. ...

      Previous Work (Optional):

      1. Discussions on AGENT-172

      Open questions::

      Done Checklist

      • CI - CI is running, tests are automated and merged.
      • Release Enablement <link to Feature Enablement Presentation>
      • DEV - Upstream code and tests merged: <link to meaningful PR or GitHub Issue>
      • DEV - Upstream documentation merged: <link to meaningful PR or GitHub Issue>
      • DEV - Downstream build attached to advisory: <link to errata>
      • QE - Test plans in Polarion: <link or reference to Polarion>
      • QE - Automated tests merged: <link or reference to automated tests>
      • DOC - Downstream documentation merged: <link to meaningful PR>

            Unassigned Unassigned
            zabitter Zane Bitter
            Votes:
            0 Vote for this issue
            Watchers:
            3 Start watching this issue

              Created:
              Updated: