Uploaded image for project: 'Ansible Automation Platform RFEs'
  1. Ansible Automation Platform RFEs
  2. AAPRFE-186

Better failure handling of running jobs on execution nodes

XMLWordPrintable

    • False
    • Hide

      None

      Show
      None
    • False

      What is the nature and description of the request?

      Currently when a job fails on an execution node it needs to be manually relaunched and may leave a managed node in an inconsistent state. This request is to have a way for Controller to do its best to prevent nodes being left in an inconsistent state should a running job fail during the run.

      Why does the customer need this? (List the business requirements here)

      If a job fails it can leave the managed node in an undesired or inconsistent state depending on where it failed.

      How would you like to achieve this? (List the functional requirements here)

      The customer would like to see the job picked up in progress by other EE nodes, if available, within the instance group.

      Unsure if there are other ways to go about this.

            chadwickferman Chad Ferman
            chadwickferman Chad Ferman
            Votes:
            1 Vote for this issue
            Watchers:
            3 Start watching this issue

              Created:
              Updated: