Uploaded image for project: 'OpenShift Etcd'
  1. OpenShift Etcd
  2. ETCD-656

Automate datadir move after quorum-restore

XMLWordPrintable

    • Icon: Story Story
    • Resolution: Done
    • Icon: Undefined Undefined
    • None
    • None
    • None
    • BU Product Work
    • 2
    • False
    • None
    • False
    • OCPSTRAT-539 - Enhance recovery procedure for full control plane failure
    • ETCD Sprint 259, ETCD Sprint 260

      After running the quorum restore script we want to bring the other members back into the cluster automatically. 

      Currently the init container in 

      https://github.com/openshift/etcd/blob/openshift-4.17/openshift-tools/pkg/discover-etcd-initial-cluster/initial-cluster.go

      is guarding that case by checking whether the member is part of the cluster already and has an empty datadir.

      We need to adjust this check by testing whether the cluster id of the currently configured member and the current datadir refer to the same cluster.

      When we detect a mismatch, we can assume the cluster was recovered by quorum restore and we can attempt to move the folder to automatically make the member join the cluster again.

            tjungblu@redhat.com Thomas Jungblut
            tjungblu@redhat.com Thomas Jungblut
            Votes:
            0 Vote for this issue
            Watchers:
            1 Start watching this issue

              Created:
              Updated:
              Resolved: