-
Bug
-
Resolution: Done-Errata
-
Normal
-
4.18.z
-
None
-
Important
-
None
-
False
-
-
N/A
-
Release Note Not Required
-
Done
Description of problem:
In ARO HCP, the hosted control plane servicemonitor monitor-ovn-control-plane-metrics scraping fails with the error: Get "https://10.128.64.200:9108/metrics": tls: failed to verify certificate: x509: certificate is valid for ovnkube-control-plane.ocm-arohcpdev-2hsuivk8tpt8oen8sav68o8s6vpucm38-tschneid.svc, ovnkube-control-plane.ocm-arohcpdev-2hsuivk8tpt8oen8sav68o8s6vpucm38-tschneid.svc.cluster.local, ovnkube-control-plane, localhost, not ovn-kubernetes-control-plane.ocm-arohcpdev-2hsuivk8tpt8oen8sav68o8s6vpucm38-tschneid.svc This is because the TLS cannot be established since the certificate SANs doesn't contain the dns entry for the service endpoint. Either the certificate is incorrect or the service endpoint is incorrect.
Version-Release number of selected component (if applicable):
Not sure about hypershifts version release, we are using image arohcpsvcdev.azurecr.io/acm-d/rhtap-hypershift-operator@sha256:305f45bf036f84255d41c20517c70a9cb18af3dcdfa71a82a5716dde77c9e2c3
The hosted cluster version is 4.18.1
How reproducible:
Every time
Steps to Reproduce:
1. Create ARO-HCP svc/mgmt cluster 2. Deploy OSS Prometheus and configure it to scrape hosted control plane metrics 3. Observe the error in the Pometheus UI under "Targets"
Actual results:
monitor-ovn-control-plane-metrics scraping fails with tls error
Expected results:
monitor-ovn-control-plane-metrics scraping succeeds.
Additional info:
- is related to
-
OCPBUGS-54912 node-tuning-operator servicemonitor doesn't seem to be working
-
- New
-
- links to
-
RHEA-2024:11038 OpenShift Container Platform 4.19.z bug fix update