Uploaded image for project: 'OpenShift Bugs'
  1. OpenShift Bugs
  2. OCPBUGS-29315

Observed CPMS with 2 masters with index 1 and none with index 2

XMLWordPrintable

    • Icon: Bug Bug
    • Resolution: Duplicate
    • Icon: Undefined Undefined
    • None
    • 4.15
    • None
    • Quality / Stability / Reliability
    • False
    • Hide

      None

      Show
      None
    • None
    • None
    • No
    • None
    • None
    • None
    • None
    • None
    • None
    • None
    • None
    • None
    • None
    • None

      During a CPMS rollout I saw the following state:

      [cloud-user@emilien-test ~]$ oc -n openshift-machine-api get machines.m
      NAME                            PHASE      TYPE                    REGION      ZONE   AGE
      ocp-foch-2qqxj-master-4p5l9-1   Deleting   m1.xlarge.noephemeral   regionOne   nova   121m
      ocp-foch-2qqxj-master-7qsv5-1   Running    m1.xlarge               regionOne   nova   15m
      ocp-foch-2qqxj-master-nnhpq-2   Deleting   m1.xlarge.noephemeral   regionOne   nova   100m
      ocp-foch-2qqxj-master-zwt4q-0   Running    m1.xlarge               regionOne   nova   32m
      ocp-foch-2qqxj-worker-0-mmbn4   Running    m1.large                regionOne   nova   12h
      
      [cloud-user@emilien-test ~]$ oc get node
      NAME                            STATUS   ROLES                  AGE    VERSION
      ocp-foch-2qqxj-master-4p5l9-1   Ready    control-plane,master   118m   v1.28.6+f1618d5
      ocp-foch-2qqxj-master-7qsv5-1   Ready    control-plane,master   12m    v1.28.6+f1618d5
      ocp-foch-2qqxj-master-nnhpq-2   Ready    control-plane,master   98m    v1.28.6+f1618d5
      ocp-foch-2qqxj-master-zwt4q-0   Ready    control-plane,master   29m    v1.28.6+f1618d5
      ocp-foch-2qqxj-worker-0-mmbn4   Ready    worker                 12h    v1.28.6+f1618d5
      

      Shortly after I captured this the ocp-foch-2qqxj-master-nnhpq-2 node was deleted, leaving the cluster with 3 nodes:

      • ocp-foch-2qqxj-master-4p5l9-1
      • ocp-foch-2qqxj-master-7qsv5-1
      • ocp-foch-2qqxj-master-zwt4q-0

      I was not intentionally running any concurrent operation at the time.

      Time was approximately 16:29 GMT on 2024-09-09.

      I have attached a must-gather.

              joelspeed Joel Speed
              rhn-gps-mbooth Matthew Booth
              None
              None
              Zhaohua Sun Zhaohua Sun
              None
              Votes:
              0 Vote for this issue
              Watchers:
              3 Start watching this issue

                Created:
                Updated:
                Resolved: