Uploaded image for project: 'OpenShift Bugs'
  1. OpenShift Bugs
  2. OCPBUGS-63473

Rebase etcd to 3.5.24

XMLWordPrintable

    • Icon: Bug Bug
    • Resolution: Unresolved
    • Icon: Major Major
    • 4.21.0
    • 4.19.z, 4.20.0, 4.21.0
    • Etcd
    • None
    • Quality / Stability / Reliability
    • False
    • Hide

      None

      Show
      None
    • None
    • None
    • None
    • Rejected
    • None
    • Done
    • Bug Fix
    • Hide
      Before this update, during rolling cluster updates from etcd 3.5.19 to a release of 3.6, the wrong membership data could be propagated to new members. As a consequence, cluster updates failed with an error about too many learner members in the cluster. With this release, etcd is updated to 3.5.24, which includes fixes so that the membership-related errors no longer occur.
      Show
      Before this update, during rolling cluster updates from etcd 3.5.19 to a release of 3.6, the wrong membership data could be propagated to new members. As a consequence, cluster updates failed with an error about too many learner members in the cluster. With this release, etcd is updated to 3.5.24, which includes fixes so that the membership-related errors no longer occur.
    • None
    • None
    • None
    • None

      Description of problem:

          During rolling cluster upgrades from etcd 3.5.19 to 3.6.x, a membership data propagation issue occurs where wrong membership data may be propagated to new members during snapshot
        transfers. This results in cluster upgrade failures with the error etcdserver: too many learner members in cluster.

      Version-Release number of selected component (if applicable):

          4.19-4.21

      How reproducible:

          Consistently reproducible during multi-stage cluster upgrades involving etcd <=3.5.19 and subsequent upgrades to 3.6.x when adding learner members.

      Steps to Reproduce:

        1. Upgrade cluster from a version using etcd 3.5.19 to etcd 3.5.21
        2. Attempt subsequent upgrade to etcd 3.6.4 (or other 3.6.x version)
        3. Add new learner members during the upgrade process
        4. Observe membership data propagation during snapshot transfer     

      Actual results:

          Wrong membership data is propagated to new members during snapshot transfers, causing the error etcdserver: too many learner members in cluster and blocking the cluster upgrade from
        completing successfully.

      Expected results:

          Membership data should be correctly propagated to new members during snapshot transfers, allowing the cluster upgrade to complete successfully without membership-related errors.

      Additional info:

          Upstream issue: https://github.com/etcd-io/etcd/issues/20793

              alray@redhat.com Allen Ray
              alray@redhat.com Allen Ray
              None
              None
              Ge Liu Ge Liu
              None
              Votes:
              0 Vote for this issue
              Watchers:
              5 Start watching this issue

                Created:
                Updated: