Uploaded image for project: 'OpenShift Bugs'
  1. OpenShift Bugs
  2. OCPBUGS-58199

linuxptp daemonset get pods stuck in ContainerCreating because missing secret linuxptp-daemon-secret

XMLWordPrintable

    • Quality / Stability / Reliability
    • False
    • Hide

      None

      Show
      None
    • None
    • Critical
    • None
    • 7/2: Fixed and merged in 4.20
    • None
    • None
    • None
    • None
    • None
    • None
    • None
    • None
    • None
    • None

      Description of problem:

      linuxptp daemonset pods get stuck in ContainerCreating because missing secret  linuxptp-daemon-secret
          

      Version-Release number of selected component (if applicable):

      OCP: 4.20.0-ec.3 
      PTP Operator: 4.20.0-202506242215 (pre-ga)
          

      How reproducible:

      50% - from 4 daily CI jobs 2 have failed with this error, and two passed
      
          

      Steps to Reproduce:

          1. Deploy OCP: 4.20.0-ec.3 in baremetal nodes
          2. Deploy PTP Operator: 4.20.0-202506242215 (pre-ga)
          3. Check daemonset in openshift-ptp namespace and pods, half of the time they remain in ContainerCreating because missing secret linuxptp-daemon-secret
          

      Actual results:

      linuxptp daemonset pods not running half of the time and secret "linuxptp-daemon-secret" not created
          

      Expected results:

      linuxptp daemonset pods should be running 100%  of the time and secret "linuxptp-daemon-secret" should be always created.
          

      Additional info:

      
      [kni@provisioner.cluster5.dfwt5g.lab ~]$ oc get clusterversion
      NAME      VERSION       AVAILABLE   PROGRESSING   SINCE   STATUS
      version   4.20.0-ec.3   True        False         5h26m   Cluster version is 4.20.0-ec.3
      
      [kni@provisioner.cluster5.dfwt5g.lab ~]$ oc get nodes
      NAME       STATUS   ROLES                  AGE     VERSION
      master-0   Ready    control-plane,master   6h39m   v1.32.5
      master-1   Ready    control-plane,master   6h40m   v1.32.5
      master-2   Ready    control-plane,master   6h40m   v1.32.5
      worker-0   Ready    worker                 5h37m   v1.32.5
      worker-1   Ready    worker                 5h36m   v1.32.5
      worker-2   Ready    worker                 5h35m   v1.32.5
      worker-3   Ready    worker                 5h36m   v1.32.5
      
      [kni@provisioner.cluster5.dfwt5g.lab ~]$ oc -n openshift-ptp get daemonset
      NAME              DESIRED   CURRENT   READY   UP-TO-DATE   AVAILABLE   NODE SELECTOR            AGE
      linuxptp-daemon   7         7         0       7            0           kubernetes.io/os=linux   4h30m
      
      [kni@provisioner.cluster5.dfwt5g.lab ~]$ oc -n openshift-ptp get pods
      NAME                            READY   STATUS              RESTARTS        AGE
      linuxptp-daemon-45fcb           0/2     ContainerCreating   0               4h30m
      linuxptp-daemon-gd6ss           0/2     ContainerCreating   0               4h30m
      linuxptp-daemon-gsp8z           0/2     ContainerCreating   0               4h30m
      linuxptp-daemon-hk68z           0/2     ContainerCreating   0               4h30m
      linuxptp-daemon-ph9lb           0/2     ContainerCreating   0               4h30m
      linuxptp-daemon-ww6zk           0/2     ContainerCreating   0               4h30m
      linuxptp-daemon-xkv5t           0/2     ContainerCreating   0               4h30m
      ptp-operator-6d6d47dbd6-v5qph   1/1     Running             1 (4h27m ago)   4h30m
      
      [kni@provisioner.cluster5.dfwt5g.lab ~]$ oc -n openshift-ptp describe pod linuxptp-daemon-45fcb | tail -4
      Events:
        Type     Reason       Age                      From     Message
        ----     ------       ----                     ----     -------
        Warning  FailedMount  3m43s (x139 over 4h30m)  kubelet  MountVolume.SetUp failed for volume "linuxptp-certs" : secret "linuxptp-daemon-secret" not found
      
      [kni@provisioner.cluster5.dfwt5g.lab ~]$ oc -n openshift-ptp get secrets
      NAME                        TYPE                DATA   AGE
      ptp-operator-service-cert   kubernetes.io/tls   3      4h30m
      webhook-server-cert         kubernetes.io/tls   2      4h30m
      
          

      CI job: https://www.distributed-ci.io/jobs/50706b8c-4dc1-43a0-9374-1529ef1bd524/jobStates?sort=date
      Must-gather log: must_gather.tar.gz

              aputtur@redhat.com Aneesh Puttur
              rhn-gps-manrodri Manuel Rodriguez
              None
              None
              Yang Liu Yang Liu
              None
              Votes:
              0 Vote for this issue
              Watchers:
              5 Start watching this issue

                Created:
                Updated:
                Resolved: