Uploaded image for project: 'Red Hat OpenStack Services on OpenShift'
  1. Red Hat OpenStack Services on OpenShift
  2. OSPRH-1340

As a cloud operator with 200 external data plane hosts, I want to troubleshoot failures in Ansible execution on the data plane quickly and easily, so that I can zero in on the failure and resolve it.

XMLWordPrintable

    • As a cloud operator with 200 external data plane hosts, I want to troubleshoot failures in Ansible execution on the data plane quickly and easily, so that I can zero in on the failure and resolve it.
    • False
    • Hide

      None

      Show
      None
    • False
    • Proposed
    • Committed
    • To Do
    • RHOSSTRAT-270 - Red Hat OpenStack 18.0 Greenfield Deployment
    • Proposed
    • No impact
    • 0% To Do, 0% In Progress, 100% Done
    • Rejected
    • 2023Q2

      Answer the following questions:

      • What is the user experience like when troubleshooting when one node on the data plane is down?
      • What is the user experience like when troubleshooting when a service on a data plane node is broken in some weird way (eg: perhaps libvirt will not come up)?
      • How long does the execution of the data plane work take? Is it linear, based on the number of nodes or does it scale in a different way?
        • Initial bootstrap/install for the whole data plane
        • Re-run bootstrap/install after resolving a failure on a subset of data plane hosts
        • Implement a change in configuration which applies to the whole data plane (test one that implements via nova operator, and one via data plane operator)

      Come up with suggestions for improvement to find a good mix between execution time and usability.

              pweeks@redhat.com Phillip Weeks
              rhn-engineering-jpretori Jesse Pretorius
              rhos-dfg-df
              Votes:
              0 Vote for this issue
              Watchers:
              8 Start watching this issue

                Created:
                Updated:
                Resolved: