Uploaded image for project: 'OpenShift Bugs'
  1. OpenShift Bugs
  2. OCPBUGS-77162

[4.16] Role metrics are missing after restarting sidecar container in linuxptp-daemon

    • Quality / Stability / Reliability
    • False
    • Hide

      None

      Show
      None
    • None
    • Moderate
    • Yes
    • None
    • None
    • None
    • In Progress
    • Release Note Not Required
    • N/A
    • None
    • None
    • None
    • None

      Description of problem:

      openshift_ptp_interface_role gauge is missing from metrics after restarting sidecar container.
      Automated test and manual test steps show same result.
      
      
          

      Version-Release number of selected component (if applicable):

      ptp-operator.v4.16.0-202602140116
      Cluster version is 4.16.57 

      How reproducible:

      100%    

      Steps to Reproduce:

      1.  Deploy 4.16 SNO with PTP
      
      2.  Examine metrics:
      oc exec -it ds/linuxptp-daemon  -n openshift-ptp -c linuxptp-daemon-container -- curl -s localhost:9091/metrics
      
      Metrics should include a section for openshift_ptp_interface_role gauge:
      # TYPE openshift_ptp_interface_role gauge
      openshift_ptp_interface_role{iface="eno12399",node="helix60.lab.eng.rdu2.redhat.com",process="ptp4l"} 1
      openshift_ptp_interface_role{iface="ens3f1np1",node="helix60.lab.eng.rdu2.redhat.com",process="ptp4l"} 1
      openshift_ptp_interface_role{iface="ens5f0",node="helix60.lab.eng.rdu2.redhat.com",process="ptp4l"} 2
      openshift_ptp_interface_role{iface="ens5f1",node="helix60.lab.eng.rdu2.redhat.com",process="ptp4l"} 2
      openshift_ptp_interface_role{iface="ens5f2",node="helix60.lab.eng.rdu2.redhat.com",process="ptp4l"} 2
      openshift_ptp_interface_role{iface="ens5f3",node="helix60.lab.eng.rdu2.redhat.com",process="ptp4l"} 2
      openshift_ptp_interface_role{iface="ens7f3",node="helix60.lab.eng.rdu2.redhat.com",process="ptp4l"} 2
      3.  Restart sidecar:
      oc exec ds/linuxptp-daemon -c cloud-event-proxy -n openshift-ptp -- sh -c "kill -9 \$(ps aux | grep cloud-event-proxy | grep -v grep | awk '{print \$2}')"
      4.  Wait for recovery and check metrics for openshift_ptp_interface_role gauge:
      oc exec -it ds/linuxptp-daemon  -n openshift-ptp -c linuxptp-daemon-container -- curl -s localhost:9091/metrics | grep -i role

      Actual results:

      No openshift_ptp_interface_role gauge is shown in metrics.

      Expected results:

      openshift_ptp_interface_role gauge shows role for all interfaces.

      Additional info:

      related to OCPBUGS-59269 the fix backported to 4.18

              micosta@redhat.com Michele Tomaso Costa
              henshay Hen Shay Hassid
              None
              None
              Hen Shay Hassid Hen Shay Hassid
              None
              Votes:
              0 Vote for this issue
              Watchers:
              1 Start watching this issue

                Created:
                Updated: