Uploaded image for project: 'OpenShift Bugs'
  1. OpenShift Bugs
  2. OCPBUGS-2139

Pods not getting migrated when the master node is powered off

    XMLWordPrintable

Details

    • Bug
    • Resolution: Not a Bug
    • Undefined
    • None
    • 4.11.0
    • Node / Kubelet
    • Moderate
    • False
    • Hide

      None

      Show
      None
    • If Release Note Needed, Set a Value
    • Set a Value

    Description

      Description of problem:
      Pods not getting migrated when the master node is powered off.

      Version-Release number of selected component (if applicable):

      How reproducible:

      Steps to Reproduce:
      1.Created a OCP cluster with 3 master/worker nodes
      2.Deployed a pod and it has been running node-3
      3.Performed power off operation on the node-3

      [amk@amk ~]$ oc get nodes
      NAME STATUS ROLES AGE VERSION
      cephqe-node1.lab.eng.blr.redhat.com Ready master,worker 27d v1.24.0+b62823b
      cephqe-node2.lab.eng.blr.redhat.com Ready master,worker 27d v1.24.0+b62823b
      cephqe-node3.lab.eng.blr.redhat.com NotReady master,worker 27d v1.24.0+b62823b
      [amk@amk ~]$ oc version
      Client Version: 4.11.0-0.nightly-2022-07-11-080250
      Kustomize Version: v4.5.4
      Server Version: 4.11.0-0.nightly-2022-09-02-184920
      Kubernetes Version: v1.24.0+b62823b
      [amk@amk ~]$ oc get pods -n busybox-sampl
      Unable to connect to the server: x509: certificate is valid for kubernetes, kubernetes.default, kubernetes.default.svc, kubernetes.default.svc.cluster.local, openshift, openshift.default, openshift.default.svc, openshift.default.svc.cluster.local, 172.30.0.1, not api.ocp-mr-2308-1.ceph-qe.rh-ocs.com
      [amk@amk ~]$ oc get pods -n busybox-sample
      NAME READY STATUS RESTARTS AGE
      busybox 1/1 Terminating 1 14d
      busybox-1 0/1 Terminating 0 11d
      busybox-2 1/1 Terminating 1 11d
      busybox-3 1/1 Terminating 1 11d
      busybox-4 1/1 Terminating 1 11d
      [amk@amk ~]$

      Actual results:
      Pod did not migrate and pod moved to Terminating state.

      Expected results:
      Pod should migrate to other nodes and pod should not fail.

      Additional info:
      must gather Logs are placed at : http://magna002.ceph.redhat.com/ceph-qe-logs/amar/ocp_node_down/must-gather.local.2396732042640378866/

      Attachments

        Issue Links

          Activity

            People

              rphillip@redhat.com Ryan Phillips
              rhn-support-amk Amarnath reddy
              Sunil Choudhary Sunil Choudhary
              Votes:
              0 Vote for this issue
              Watchers:
              3 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: