Uploaded image for project: 'OpenStack Strategy'
  1. OpenStack Strategy
  2. RHOSSTRAT-1163

Provide a shared-nothing architecture for RHOSO

XMLWordPrintable

    • Icon: Feature Feature
    • Resolution: Unresolved
    • Icon: Normal Normal
    • None
    • None
    • PIDONE
    • None
    • Not Selected
    • False
    • False
    • Hide

      None

      Show
      None
    • 0
    • rhos-docs

      Feature Overview 

      Deployment of separate MariaDB Galera and RabbitMQ clusters for each OpenStack service is desired in some scenarios where the scale or deployment of the OpenStack environment would justify the additional resource usage.

      This allows: 

      • Impact reduction when a Galera cluster is down.
      • Intervention on a focused part of the infrastructure.
      • Better distribution of a large number of requests.

      Goals

      Provide a shared-nothing architecture for Red Hat OpenStack Services on OpenShift (RHOSO) to eliminate performance bottlenecks and single points of failure at scale.

      • Create automated testing, including validated architecture enablement.
      • Validate the shared-nothing architecture works at an appropriate scale.

      Provide documentation to allow the deployment of service-aligned deployments of the database and message bus.

      • Create a knowledge base article that broadly captures the general requirements and procedure to implement multiple database and message bus deployments, aligned to each deployed OpenStack service.
      • Create official documentation in the OpenStack guide.

      Requirements

      Requirement Notes isMVP?
      Creation of base KBA providing general guidance about how to implement service-aligned databases and message queues. Intended to be provided in the short term. Yes
      Creation of documentation in the OpenStack guide providing the appropriate content such as concept, procedure and debugging modules. Will be provided in a future feature release. Yes
      Creation of automated testing interfaces to allow validation of a shared-nothing architecture for each release. Implement the required amount of automated testing to provide confidence in the delivery of the shared-nothing architecture. Yes
      Implementation of a validated architecture deployment for shared-nothing architecture providing a viewpoint of how the enablement should be done. Implement or expand an existing validated architecture so that the shared-nothing architecture can be referenced. For greenfield deployments that are not being upgraded or adopted, the deployment of a shared-nothing architecture may be the preferred default. Yes

       
      Done - Acceptance Criteria 

      A knowledge base article (KBA) is available for consumption by appropriate end users that need access to the deployment in the short term. Any usage of the KBA information in production will likely require a support exception to allow engineering review and considerations.

      This feature will be completed when the functionality has been automated, validated, and documented in the OpenStack guide and made generally available for use.

      Use Cases - i.e. User Experience & Workflow: 

      [Use Case] A partner has created a large scale deployment of RHOSO, with 900 compute nodes and extensive workloads for the first cluster. The partner wishes to achieve 1500 compute nodes for the second RHOSO cluster. In their current Legacy Cloud, the partner has a configuration with one Database cluster per OpenStack service.

      [Workflow] The OpenStack administrator will enable the deployment of per-service infrastructure by configuring the RHOSO environment using the standard CustomResource interfaces already provided, along with per-service secrets and other potential per-service objects.

      Out of Scope

      No additional development should be necessary to implement this feature as it is expected the deployment and configuration will be possible with the current set of CustomResourceDefinitions provided by RHOSO 18.

      It may not be possible to transition from a shared architecture to a shared-nothing architecture. Needs validation.

      Documentation Considerations

      An initial KBA will be provided that gives an example configuration and basic procedures allowing advanced RHOSO administrators to deploy per-service infrastructure (database, message bus). Engineering review will be all that is required to ship this KBA.

      A separate mini-content journey that explores a more complete set of documentation will also need to be created and managed as a separate delivery effort targeting a future feature release.

      On Upstream side, Large Scale SIG group has already done tests and is recommending  to split the Galera cluster into separate clusters, here [1]

      [1]: https://docs.openstack.org/large-scale/journey/configure/database.html

      Questions to Answer

      • Should this Feature be scoped larger to include automated testing or be limited to a documentation-only feature?
        • if this feature is documentation-only, how should automated testing for this architecture be tracked?
        • could create an Outcome for this overall effort and have separate Features / Initiatives that track the documentation delivery separate of automated testing.
        • YES this feature is now expanded to include the overall delivery of this functionality including automated testing and not just documentation.

      Background and Strategic Fit

      Some customers already have a shared-nothing architecture deployment. In some cases the scale or architectural decisions for some RHOSO environments justifies the additional resource usage for segregated database and message bus interfaces for each OpenStack service.

      Customer Considerations

      The customer should be able to follow this procedure as part of their RHOSO deployment planning.

      Team Sign Off (Completion while in Planning status)

      • All required Epics (known at the time) are linked to the this Feature
      • All required Stories, Tasks (known at the time) for the most immediate Epics have been created and estimated
      • Add - Reviewers name, Team Name
      • Acceptance == Feature as “Ready” - well understood and scope is clear - Acceptance Criteria (scope) is elaborated, well defined, and understood
      • Note: Only set FixVersion/s: on a Feature if the delivery team agrees they have the capacity and have committed that capability for that milestone
        Reviewed By Team Name Accepted Notes
        Engineering (dataplane) rhos-ops-day1day2-edpm    
        Engineering (control plane) rhos-conplat-core-operators    
        Engineering (message bus, database) rhos-ops-platform-services-pidone    
        Documentation rhos-docs    
        Product Management Gil Rosenberg    

              mariel@redhat.com Mikey Ariel
              smsallem@redhat.com Soumaya Msallem
              Gil Rosenberg, Sean Mooney
              Gil Rosenberg Gil Rosenberg
              Edu Alcaniz Edu Alcaniz
              rhos-docs
              Votes:
              0 Vote for this issue
              Watchers:
              5 Start watching this issue

                Created:
                Updated: