Uploaded image for project: 'OpenShift Bugs'
  1. OpenShift Bugs
  2. OCPBUGS-74985

ptp operator 4.18 unstable - linuxptp-daemon showing pmc errors

    • None
    • False
    • Hide

      None

      Show
      None
    • None
    • Critical
    • Yes
    • None
    • None
    • None
    • None
    • None
    • None
    • None
    • None
    • None
    • None
    • None

      Description of problem:

      ptp-operator.v4.18.0-202601292038

      Basic ptp tests failed during regression test.  Inspection of linuxptp-daemon shows repeated pmc errors:

      
      [kni@registry.kni-qe-85 ~]$ oc logs -n openshift-ptp ds/linuxptp-daemon linuxptp-daemon-container | grep -i err
      E0203 09:47:24.121170    6813 daemon.go:987] Write phc2sys[5729.202]: [phc2sys.4.config:6] CLOCK_REALTIME phc offset        -8 s2 freq  -19004 delay    628 error write unix @->/cloud-native/events.sock: write: broken pipe:
      E0203 09:47:24.124809    6813 daemon.go:987] Write ptp4l[5735.563]: [ptp4l.1.config:6] master offset          0 s2 freq      -1 path delay       221 error write unix @->/cloud-native/events.sock: write: broken pipe:
      E0203 09:47:24.124814    6813 daemon.go:987] Write ptp4l[5735.500]: [ptp4l.0.config:6] master offset          2 s2 freq      +5 path delay       206 error write unix @->/cloud-native/events.sock: write: broken pipe:
      E0203 09:47:24.124882    6813 daemon.go:987] Write ptp4l[5736.703]: [ptp4l.2.config:6] master offset         -2 s2 freq   -7766 path delay       542 error write unix @->/cloud-native/events.sock: write: broken pipe:
      E0203 09:47:24.125026    6813 daemon.go:987] Write ptp4l[5736.463]: [ptp4l.3.config:6] master offset         -2 s2 freq   -7765 path delay       537 error write unix @->/cloud-native/events.sock: write: broken pipe:
      E0203 09:48:29.757587    6813 pmc.go:53] pmc result match error expect: timer expired after 2 seconds
      E0203 09:48:29.758586    6813 daemon.go:860] error parsing PMC util for clock class change event expect: timer expired after 2 seconds
      E0203 09:49:27.755639    6813 daemon.go:987] Write phc2sys[5850.350]: [phc2sys.4.config:6] CLOCK_REALTIME phc offset        -2 s2 freq  -18997 delay    720 error write unix @->/cloud-native/events.sock: write: broken pipe:
      E0203 09:49:27.755742    6813 daemon.go:987] Write ptp4l[5855.526]: [ptp4l.0.config:6] master offset         -1 s2 freq      -4 path delay       223 error write unix @->/cloud-native/events.sock: write: broken pipe:
      E0203 09:49:27.755872    6813 daemon.go:987] Write ptp4l[5804.472]: [ptp4l.1.config:6] master offset          0 s2 freq      -0 path delay       207 error write unix @->/cloud-native/events.sock: write: broken pipe:
      E0203 09:49:27.755782    6813 daemon.go:987] Write ptp4l[5856.491]: [ptp4l.3.config:6] master offset          5 s2 freq   -7765 path delay       537 error write unix @->/cloud-native/events.sock: write: broken pipe:
      E0203 09:49:27.961178    6813 daemon.go:987] Write ptp4l[5915.314]: [ptp4l.2.config:5] selected best master clock 001747.fffe.70181a error write unix @->/cloud-native/events.sock: write: broken pipe:
      E0203 09:51:27.762505    6813 daemon.go:987] Write ptp4l[5977.536]: [ptp4l.2.config:6] master offset          0 s2 freq   -7778 path delay       542 error write unix @->/cloud-native/events.sock: write: broken pipe:
          

      Metrics show numerous ptp process restarts:

      # HELP openshift_ptp_process_restart_count 
      # TYPE openshift_ptp_process_restart_count counter
      openshift_ptp_process_restart_count{config="phc2sys.4.config",node="helix76.telcoqe.eng.rdu2.dc.redhat.com",process="phc2sys"} 66
      openshift_ptp_process_restart_count{config="ptp4l.0.config",node="helix76.telcoqe.eng.rdu2.dc.redhat.com",process="ptp4l"} 67
      openshift_ptp_process_restart_count{config="ptp4l.1.config",node="helix76.telcoqe.eng.rdu2.dc.redhat.com",process="ptp4l"} 67
      openshift_ptp_process_restart_count{config="ptp4l.2.config",node="helix76.telcoqe.eng.rdu2.dc.redhat.com",process="ptp4l"} 65
      openshift_ptp_process_restart_count{config="ptp4l.3.config",node="helix76.telcoqe.eng.rdu2.dc.redhat.com",process="ptp4l"} 67
      

      Version-Release number of selected component (if applicable):

      ptp-operator.v4.18.0-202601292038    

      How reproducible:

      100%    

      Steps to Reproduce:

          1. Deploy ptp-operator.v4.18.0-202601292038
          2. Watch linuxptp-daemon log
          

      Actual results:

          

      Expected results:

          

      Additional info:

          

              josricha@redhat.com Joseph Richard
              bblock@redhat.com Bonnie Block
              None
              None
              Bonnie Block Bonnie Block
              None
              Votes:
              0 Vote for this issue
              Watchers:
              3 Start watching this issue

                Created:
                Updated: