Uploaded image for project: 'OpenShift Bugs'
  1. OpenShift Bugs
  2. OCPBUGS-65540

pcs debug-start can fail due to node already being marked as a learner

XMLWordPrintable

    • Icon: Bug Bug
    • Resolution: Unresolved
    • Icon: Normal Normal
    • None
    • 4.21
    • Two Node Fencing
    • Quality / Stability / Reliability
    • False
    • Hide

      None

      Show
      None
    • None
    • None
    • None
    • None
    • None
    • Rejected
    • None
    • None
    • None
    • None
    • None
    • None
    • None
    • None

      Description of problem:

      During a node replacement test running in CI, we run pcs debug stop and pcs debug start to force the resource agent to start without needed to fence the node. For some reason in CI, this hits an error on startup where the dead node is already marked as a learner, so when the survivor tries to start up, it fails when it tries to add the failed node as a learner (since it's already there).
      
      

      Steps to Reproduce:

      I was able to reproduce this in CI only by using debug-stop and debug-start
      
      

      Actual results:

      etcd fails to start on the survivor (cannot add learner member)

      Expected results:

      etcd starts on the survivor

      Additional info:

       

              rh-ee-clobrano Carlo Lobrano
              jpoulin Jeremy Poulin
              None
              None
              Douglas Hensel Douglas Hensel
              None
              Votes:
              0 Vote for this issue
              Watchers:
              1 Start watching this issue

                Created:
                Updated: