Uploaded image for project: 'OpenShift Container Platform (OCP) Strategy'
  1. OpenShift Container Platform (OCP) Strategy
  2. OCPSTRAT-2024

Consolidate Control Plane Recovery Docs

XMLWordPrintable

    • Product / Portfolio Work
    • None
    • False
    • Hide

      None

      Show
      None
    • False
    • None
    • None
    • None
    • None
    • None
    • None
    • None

      Feature Overview

      In 4.18 we introduced the automated recovery of the control plane nodes, where when the cluster loses quorum due to control plane nodes failure recovering it is done running a recovery script and quorum is restored almost immediately, before we bring back the nodes that failed:

      https://docs.redhat.com/en/documentation/openshift_container_platform/4.18/html/backup_and_restore/control-plane-backup-and-restore#dr-quorum-restoration 

      This process replaces a series os manual steps that aren't needed after the added automation. The manual steps are still documented:

      https://docs.redhat.com/en/documentation/openshift_container_platform/4.17/html/backup_and_restore/control-plane-backup-and-restore#dr-restoring-cluster-state 

      This document has 24 sections with a total of 75 steps, which is more prone to human error than the new procedure.

      Goals 

      • Validate that the manual process doesn't cover any use case that the automated process doesn't cover.
      • Remove the manual process once confirmed we don't need it and it can be entirely replaced by the new method.

              racedoro@redhat.com Ramon Acedo
              racedoro@redhat.com Ramon Acedo
              None
              None
              None
              None
              Matthew Werner Matthew Werner
              Kyle Walker Kyle Walker
              Votes:
              0 Vote for this issue
              Watchers:
              2 Start watching this issue

                Created:
                Updated: