Loading...

XML

Word

Printable

Type: Bug
Resolution: Done-Errata
Priority: Major
Fix Version/s: None
Affects Version/s: 4.14
Component/s: Networking / ptp
Labels:
- VLAN
- interfaces
- metrics
- prometheus
- ptp

Activity Type:
Quality / Stability / Reliability
Blocked:
False
Blocked Reason:

Hide

None

Show
None
Story Points:
None
Severity:
Important
Regression:
None
Latest Status Summary:
7/03: 4.14 code merged , but not verified

Target Backport Versions:
None
Target Version:

4.14.z
Release Blocker:
None
Sprint:
None

RH Private Keywords:

SFDC Cases Counter:
SFDC Cases Open:
SFDC Cases Links:

PX Impact Score:

Release Note Status:
In Progress
Release Note Type:
Bug Fix
Release Note Text:

Hide
*Cause*: The summary metrics were not being masked correctly.
*Consequence*: When a port went into a faulty state, the part of the code responsible for setting the fault offset correctly masked the name. However, because the initial summary metrics weren't masked, a new interface appeared with a value that remained unchanged, effectively displaying incorrect/stale data.
*Fix*: The masking/aliasing of the summary metrics was implemented.
*Result*: Bug doesn’t present anymore.

Show
*Cause*: The summary metrics were not being masked correctly. *Consequence*: When a port went into a faulty state, the part of the code responsible for setting the fault offset correctly masked the name. However, because the initial summary metrics weren't masked, a new interface appeared with a value that remained unchanged, effectively displaying incorrect/stale data. *Fix*: The masking/aliasing of the summary metrics was implemented. *Result*: Bug doesn’t present anymore.

Escape Reason:
Escape Impact:
Corrective Measures:
SDLC stage when should've been found:

Description of problem:

RH PTP pod linux-ptp-daemon-xx pod is exporting metrics for non-existing interfaces when kept on running for more than 2-3 hours

Version-Release number of selected component (if applicable):

OCP v4.14

How reproducible:

Happening on customer environment.

Steps to Reproduce:

1. Apply PtpConfig mentioning VLAN interfaces e.g. ens8f0np0.20, ens9f1np1.400
2. Next, wait for few hours & watch Prometheus/Grafan logs
3. See the non-existing interfaces being reported as 999999 ns in Grafana.

Actual results:

Unknown interface are seen with high openshift_ptp_offset_ns value

Expected results:

To only see the interfaces mentioned in PtpConfig

Additional info:

As a workaround, I had suggested them to create a node-level Prometheus filter rule & that had helped them get the correct metrics as desired.
openshift_ptp_offset_ns{interface=~"ens9f1np1.400|ens8f0np0.20"}

This confirms the that incorrect metrics are also being exported by ptp4l for some unknown interfaces.

links to

openshift/linuxptp-daemon#457: [release-4.14] OCPBUGS-55309: Fix interface name for summary metrics

redhat-cne/cloud-event-proxy#550: [release-4.14] OCPBUGS-55309: Replaces all aliasing lines with a common function

RHBA-2025:11669 OpenShift Container Platform 4.14.54 bug fix update

Assignee:: Michele Tomaso Costa

Reporter:: Akash Dubey

QA Contact:: Bonnie Block

Need Info From:: None

Votes:: 0 Vote for this issue

Watchers:: 6 Start watching this issue

Created:: 2025/04/24 7:58 AM

Updated:: 2025/10/26 12:35 AM

Resolved:: 2025/07/31 3:57 AM

Details

Description

Attachments

Issue Links

Easy Agile Planning Poker

Activity

People

Dates