Loading...

XML

Word

Printable

Type: Bug
Resolution: Unresolved
Priority: Normal
Fix Version/s: 4.22.0
Affects Version/s: 4.19
Component/s: HyperShift
Labels:
None

Activity Type:
None
Blocked:
False
Blocked Reason:

Hide

None

Show
None
Story Points:
None
Severity:
None
Regression:
None

Target Backport Versions:
None
Target Version:

4.22.0
Release Blocker:
None
Sprint:
None

SFDC Cases Counter:
SFDC Cases Open:
SFDC Cases Links:

PX Impact Score:

Release Note Status:
None
Release Note Type:
None
Release Note Text:
None

Escape Reason:
None
Escape Impact:
None
Corrective Measures:
None
SDLC stage when should've been found:
None

Description of problem:

Getting alert for node-tuning operator down for all HCP clusters

Version-Release number of selected component (if applicable):

4.19

How reproducible:

 Always

Additional Details:

The hub cluster shows that node tuning operator is down for all the hosted cluster. However, the operator is up and working fine in the hosted cluster

$ oc get servicemonitor/node-tuning-operator -n clusters-test -ojson | jq .spec.selector
{
  "matchLabels": {
    "hypershift.openshift.io/control-plane-component": "cluster-node-tuning-operator",
    "name": "node-tuning-operator"
  }
}

$ oc get svc -l hypershift.openshift.io/control-plane-component=cluster-node-tuning-operator -n clusters-test
NAME                   TYPE        CLUSTER-IP   EXTERNAL-IP   PORT(S)     AGE
node-tuning-operator   ClusterIP   None         <none>        60000/TCP   164m 

$ oc get svc -l hypershift.openshift.io/control-plane-component=cluster-node-tuning-operator -n clusters-test
NAME                   TYPE        CLUSTER-IP   EXTERNAL-IP   PORT(S)     AGE
node-tuning-operator   ClusterIP   None         <none>        60000/TCP   19h

$ oc -n clusters-test rsh cluster-node-tuning-operator-867f8dcc55-9lv6g

sh-5.1$ curl localhost:60000
curl: (7) Failed to connect to localhost port 60000: Connection refused

sh-5.1$ ss -tunlp 
Netid              State               Recv-Q              Send-Q                           Local Address:Port                           Peer Address:Port             Process             
tcp                LISTEN              0                   0                                            *:8080                                      *:*                 users:(("cluster-node-tu",pid=1,fd=6))

So in short the metrics endpoint is exposed over the port :8080/metrics not on port 60000.

My assumption is that fix should be through the service yaml

https://github.com/openshift/hypershift/blob/main/control-plane-operator/controllers/hostedcontrolplane/v2/assets/cluster-node-tuning-operator/service.yaml#L10-L12

https://github.com/openshift/hypershift/blob/main/control-plane-operator/controllers/hostedcontrolplane/v2/assets/cluster-node-tuning-operator/servicemonitor.yaml#L14

links to

Node Tuning Operator Target Down in Hosted Control Plane clusters

openshift/hypershift#7468: OCPBUGS-72596: Fix node-tuning-operator metrics port configuration

Assignee:: Juan Manuel Parrilla Madrid

Reporter:: Himank Chaturvedi

QA Contact:: Jim Ma

Need Info From:: None

Votes:: 0 Vote for this issue

Watchers:: 7 Start watching this issue

Created:: 2026/01/12 9:59 AM

Updated:: 2026/02/17 5:53 PM

Details

Description

Attachments

Issue Links

Easy Agile Planning Poker

Activity

People

Dates