-
Bug
-
Resolution: Duplicate
-
Undefined
-
None
-
4.15
-
None
-
Quality / Stability / Reliability
-
False
-
-
None
-
None
-
No
-
None
-
None
-
None
-
None
-
None
-
None
-
None
-
None
-
None
-
None
-
None
During a CPMS rollout I saw the following state:
[cloud-user@emilien-test ~]$ oc -n openshift-machine-api get machines.m NAME PHASE TYPE REGION ZONE AGE ocp-foch-2qqxj-master-4p5l9-1 Deleting m1.xlarge.noephemeral regionOne nova 121m ocp-foch-2qqxj-master-7qsv5-1 Running m1.xlarge regionOne nova 15m ocp-foch-2qqxj-master-nnhpq-2 Deleting m1.xlarge.noephemeral regionOne nova 100m ocp-foch-2qqxj-master-zwt4q-0 Running m1.xlarge regionOne nova 32m ocp-foch-2qqxj-worker-0-mmbn4 Running m1.large regionOne nova 12h [cloud-user@emilien-test ~]$ oc get node NAME STATUS ROLES AGE VERSION ocp-foch-2qqxj-master-4p5l9-1 Ready control-plane,master 118m v1.28.6+f1618d5 ocp-foch-2qqxj-master-7qsv5-1 Ready control-plane,master 12m v1.28.6+f1618d5 ocp-foch-2qqxj-master-nnhpq-2 Ready control-plane,master 98m v1.28.6+f1618d5 ocp-foch-2qqxj-master-zwt4q-0 Ready control-plane,master 29m v1.28.6+f1618d5 ocp-foch-2qqxj-worker-0-mmbn4 Ready worker 12h v1.28.6+f1618d5
Shortly after I captured this the ocp-foch-2qqxj-master-nnhpq-2 node was deleted, leaving the cluster with 3 nodes:
- ocp-foch-2qqxj-master-4p5l9-1
- ocp-foch-2qqxj-master-7qsv5-1
- ocp-foch-2qqxj-master-zwt4q-0
I was not intentionally running any concurrent operation at the time.
Time was approximately 16:29 GMT on 2024-09-09.
I have attached a must-gather.