-
Bug
-
Resolution: Done
-
Undefined
-
4.20
-
Quality / Stability / Reliability
-
False
-
-
None
-
Critical
-
None
-
7/2: Fixed and merged in 4.20
-
None
-
None
-
None
-
None
-
None
-
None
-
None
-
None
-
None
-
None
Description of problem:
linuxptp daemonset pods get stuck in ContainerCreating because missing secret linuxptp-daemon-secret
Version-Release number of selected component (if applicable):
OCP: 4.20.0-ec.3
PTP Operator: 4.20.0-202506242215 (pre-ga)
How reproducible:
50% - from 4 daily CI jobs 2 have failed with this error, and two passed
Steps to Reproduce:
1. Deploy OCP: 4.20.0-ec.3 in baremetal nodes
2. Deploy PTP Operator: 4.20.0-202506242215 (pre-ga)
3. Check daemonset in openshift-ptp namespace and pods, half of the time they remain in ContainerCreating because missing secret linuxptp-daemon-secret
Actual results:
linuxptp daemonset pods not running half of the time and secret "linuxptp-daemon-secret" not created
Expected results:
linuxptp daemonset pods should be running 100% of the time and secret "linuxptp-daemon-secret" should be always created.
Additional info:
[kni@provisioner.cluster5.dfwt5g.lab ~]$ oc get clusterversion
NAME VERSION AVAILABLE PROGRESSING SINCE STATUS
version 4.20.0-ec.3 True False 5h26m Cluster version is 4.20.0-ec.3
[kni@provisioner.cluster5.dfwt5g.lab ~]$ oc get nodes
NAME STATUS ROLES AGE VERSION
master-0 Ready control-plane,master 6h39m v1.32.5
master-1 Ready control-plane,master 6h40m v1.32.5
master-2 Ready control-plane,master 6h40m v1.32.5
worker-0 Ready worker 5h37m v1.32.5
worker-1 Ready worker 5h36m v1.32.5
worker-2 Ready worker 5h35m v1.32.5
worker-3 Ready worker 5h36m v1.32.5
[kni@provisioner.cluster5.dfwt5g.lab ~]$ oc -n openshift-ptp get daemonset
NAME DESIRED CURRENT READY UP-TO-DATE AVAILABLE NODE SELECTOR AGE
linuxptp-daemon 7 7 0 7 0 kubernetes.io/os=linux 4h30m
[kni@provisioner.cluster5.dfwt5g.lab ~]$ oc -n openshift-ptp get pods
NAME READY STATUS RESTARTS AGE
linuxptp-daemon-45fcb 0/2 ContainerCreating 0 4h30m
linuxptp-daemon-gd6ss 0/2 ContainerCreating 0 4h30m
linuxptp-daemon-gsp8z 0/2 ContainerCreating 0 4h30m
linuxptp-daemon-hk68z 0/2 ContainerCreating 0 4h30m
linuxptp-daemon-ph9lb 0/2 ContainerCreating 0 4h30m
linuxptp-daemon-ww6zk 0/2 ContainerCreating 0 4h30m
linuxptp-daemon-xkv5t 0/2 ContainerCreating 0 4h30m
ptp-operator-6d6d47dbd6-v5qph 1/1 Running 1 (4h27m ago) 4h30m
[kni@provisioner.cluster5.dfwt5g.lab ~]$ oc -n openshift-ptp describe pod linuxptp-daemon-45fcb | tail -4
Events:
Type Reason Age From Message
---- ------ ---- ---- -------
Warning FailedMount 3m43s (x139 over 4h30m) kubelet MountVolume.SetUp failed for volume "linuxptp-certs" : secret "linuxptp-daemon-secret" not found
[kni@provisioner.cluster5.dfwt5g.lab ~]$ oc -n openshift-ptp get secrets
NAME TYPE DATA AGE
ptp-operator-service-cert kubernetes.io/tls 3 4h30m
webhook-server-cert kubernetes.io/tls 2 4h30m
CI job: https://www.distributed-ci.io/jobs/50706b8c-4dc1-43a0-9374-1529ef1bd524/jobStates?sort=date
Must-gather log: must_gather.tar.gz