Uploaded image for project: 'OpenShift Bugs'
  1. OpenShift Bugs
  2. OCPBUGS-14918

nodes showing duplicate with oc get nodes

XMLWordPrintable

    • Important
    • No
    • Rejected
    • False
    • Hide

      None

      Show
      None
    • 7/6: telco review pending field eng input (GW)

      Description of problem:

      Partner is seeing a duplicate nodes in the oc get nodes commands.
      This has happened a few times in their lab while installing clusters
      hostnamectl on the node show : transient hostname: master-2 and static is empty
      
      kni@ztk-installer ~]$ oc get node
      NAME                          STATUS     ROLES                                             AGE   VERSION
      master-0.vcp2.companyLab.com   Ready      control-plane,master,r740-26c-std2-combo,worker   12d   v1.25.7+eab9cc9
      master-1.vcp2.companyLab.com   Ready      control-plane,master,r740-26c-std2-combo,worker   12d   v1.25.7+eab9cc9
      master-2                      NotReady   control-plane,master,worker                       5s    v1.25.7+eab9cc9
      master-2.vcp2.companyLab.com   NotReady   control-plane,master,r740-26c-std2-combo,worker   12d   v1.25.7+eab9cc9
      
      -- etcd
      
      $ oc rsh -n openshift-etcd                                     etcd-master-1.vcp2.companyLab.com
      sh-4.4# etcdctl member list -w table
      +------------------+---------+-----------------------------+--------------------------+--------------------------+------------+
      |        ID        | STATUS  |            NAME             |        PEER ADDRS        |       CLIENT ADDRS       | IS LEARNER |
      +------------------+---------+-----------------------------+--------------------------+--------------------------+------------+
      | 3b814ebe29d47108 | started | master-2.vcp2.companyLab.com | https://10.73.11.77:2380 | https://10.73.11.77:2379 |      false |
      | d7976302434ccc8a | started | master-0.vcp2.companyLab.com | https://10.73.11.75:2380 | https://10.73.11.75:2379 |      false |
      | ebe35e8f3519226e | started | master-1.vcp2.companyLab.com | https://10.73.11.76:2380 | https://10.73.11.76:2379 |      false |
      +------------------+---------+-----------------------------+--------------------------+--------------------------+------------+
      
      
      [root@master-2 core]# hostnamectl
         Static hostname: n/a
      Transient hostname: master-2
               Icon name: computer-server
                 Chassis: server
              Machine ID: 501df91049534f84be3d7b8e69ddd985
                 Boot ID: 9827c56620fc43c59af77ba577f66357
        Operating System: Red Hat Enterprise Linux CoreOS 412.86.202303241612-0 (Ootpa)
             CPE OS Name: cpe:/o:redhat:enterprise_linux:8::coreos
                  Kernel: Linux 4.18.0-372.49.1.el8_6.x86_64
            Architecture: x86-64
      [root@master-2 core]# cat /etc/hosts
      127.0.0.1   localhost localhost.localdomain localhost4 localhost4.localdomain4
      ::1         localhost localhost.localdomain localhost6 localhost6.localdomain6
      
      
      [root@master-2 core]# cat /etc/hostname
      cat: /etc/hostname: No such file or directory

      Version-Release number of selected component (if applicable):

      4.12

      How reproducible:

      not sure

      Steps to Reproduce:

      1. install OCP 4.12.x
      2. run oc get nodes
      3. good luck
      

      Actual results:

      you get:
      node-1.domain.com Ready .... 12Days
      node-2.domain.com Ready .... 12Days
      node-3.domain.com NotReady .... 12Days
      node-3            Ready ...... 10s (or 5m ...etc.)
      
      

      Expected results:

      node-1.domain.com Ready .... 12Days
      node-2.domain.com Ready .... 12Days
      node-3.domain.com NotReady .... 12Days

      Additional info:

      the workaround is:
      delete node-3 node (oc delete node node-3) and reboot the physical node

            team-mco Team MCO
            lhalleb@redhat.com Lazhar Halleb
            Sergio Regidor de la Rosa Sergio Regidor de la Rosa
            Votes:
            0 Vote for this issue
            Watchers:
            15 Start watching this issue

              Created:
              Updated:
              Resolved: