-
Bug
-
Resolution: Done-Errata
-
Major
-
None
-
4.12
Description of problem:
Partner is seeing a duplicate nodes in the oc get nodes commands. This has happened a few times in their lab while installing clusters hostnamectl on the node show : transient hostname: master-2 and static is empty kni@ztk-installer ~]$ oc get node NAME STATUS ROLES AGE VERSION master-0.vcp2.companyLab.com Ready control-plane,master,r740-26c-std2-combo,worker 12d v1.25.7+eab9cc9 master-1.vcp2.companyLab.com Ready control-plane,master,r740-26c-std2-combo,worker 12d v1.25.7+eab9cc9 master-2 NotReady control-plane,master,worker 5s v1.25.7+eab9cc9 master-2.vcp2.companyLab.com NotReady control-plane,master,r740-26c-std2-combo,worker 12d v1.25.7+eab9cc9 -- etcd $ oc rsh -n openshift-etcd etcd-master-1.vcp2.companyLab.com sh-4.4# etcdctl member list -w table +------------------+---------+-----------------------------+--------------------------+--------------------------+------------+ | ID | STATUS | NAME | PEER ADDRS | CLIENT ADDRS | IS LEARNER | +------------------+---------+-----------------------------+--------------------------+--------------------------+------------+ | 3b814ebe29d47108 | started | master-2.vcp2.companyLab.com | https://10.73.11.77:2380 | https://10.73.11.77:2379 | false | | d7976302434ccc8a | started | master-0.vcp2.companyLab.com | https://10.73.11.75:2380 | https://10.73.11.75:2379 | false | | ebe35e8f3519226e | started | master-1.vcp2.companyLab.com | https://10.73.11.76:2380 | https://10.73.11.76:2379 | false | +------------------+---------+-----------------------------+--------------------------+--------------------------+------------+ [root@master-2 core]# hostnamectl Static hostname: n/a Transient hostname: master-2 Icon name: computer-server Chassis: server Machine ID: 501df91049534f84be3d7b8e69ddd985 Boot ID: 9827c56620fc43c59af77ba577f66357 Operating System: Red Hat Enterprise Linux CoreOS 412.86.202303241612-0 (Ootpa) CPE OS Name: cpe:/o:redhat:enterprise_linux:8::coreos Kernel: Linux 4.18.0-372.49.1.el8_6.x86_64 Architecture: x86-64 [root@master-2 core]# cat /etc/hosts 127.0.0.1 localhost localhost.localdomain localhost4 localhost4.localdomain4 ::1 localhost localhost.localdomain localhost6 localhost6.localdomain6 [root@master-2 core]# cat /etc/hostname cat: /etc/hostname: No such file or directory
Version-Release number of selected component (if applicable):
4.12
How reproducible:
not sure
Steps to Reproduce:
1. install OCP 4.12.x 2. run oc get nodes 3. good luck
Actual results:
you get: node-1.domain.com Ready .... 12Days node-2.domain.com Ready .... 12Days node-3.domain.com NotReady .... 12Days node-3 Ready ...... 10s (or 5m ...etc.)
Expected results:
node-1.domain.com Ready .... 12Days node-2.domain.com Ready .... 12Days node-3.domain.com NotReady .... 12Days
Additional info:
the workaround is: delete node-3 node (oc delete node node-3) and reboot the physical node
- is related to
-
OCPBUGS-14692 rebooted nodes return with long fqdns name, and create duplicates workers
- Closed
- links to
-
RHSA-2023:5006 OpenShift Container Platform 4.14.z security update