Uploaded image for project: 'OpenShift Bugs'
  1. OpenShift Bugs
  2. OCPBUGS-63034

ARO-HCP: Prometheus metrics scraping failing for ovnkube-control-plane

XMLWordPrintable

    • Icon: Bug Bug
    • Resolution: Unresolved
    • Icon: Normal Normal
    • 4.19.0
    • 4.18.z
    • HyperShift / ARO
    • None
    • Quality / Stability / Reliability
    • False
    • Hide

      None

      Show
      None
    • None
    • Important
    • None
    • None
    • None
    • None
    • In Progress
    • Release Note Not Required
    • N/A
    • None
    • None
    • None
    • None

      This is a clone of issue OCPBUGS-54533. The following is the description of the original issue:

      Description of problem:

      In ARO HCP, the hosted control plane servicemonitor monitor-ovn-control-plane-metrics scraping fails with the error:
      
      Get "https://10.128.64.200:9108/metrics": tls: failed to verify certificate: x509: certificate is valid for ovnkube-control-plane.ocm-arohcpdev-2hsuivk8tpt8oen8sav68o8s6vpucm38-tschneid.svc, ovnkube-control-plane.ocm-arohcpdev-2hsuivk8tpt8oen8sav68o8s6vpucm38-tschneid.svc.cluster.local, ovnkube-control-plane, localhost, not ovn-kubernetes-control-plane.ocm-arohcpdev-2hsuivk8tpt8oen8sav68o8s6vpucm38-tschneid.svc
      
      This is because the TLS cannot be established since the certificate SANs doesn't contain the dns entry for the service endpoint.  Either the certificate is incorrect or the service endpoint is incorrect.
      

       

      Version-Release number of selected component (if applicable):

      Not sure about hypershifts version release, we are using image arohcpsvcdev.azurecr.io/acm-d/rhtap-hypershift-operator@sha256:305f45bf036f84255d41c20517c70a9cb18af3dcdfa71a82a5716dde77c9e2c3
      The hosted cluster version is 4.18.1      

      How reproducible:

      Every time

      Steps to Reproduce:

          1. Create ARO-HCP svc/mgmt cluster
          2. Deploy OSS Prometheus and configure it to scrape hosted control plane metrics
          3. Observe the error in the Pometheus UI under "Targets"
          

      Actual results:

      monitor-ovn-control-plane-metrics scraping fails with tls error

      Expected results:

      monitor-ovn-control-plane-metrics scraping succeeds.

      Additional info:

       

              rh-ee-bclement Borja Clemente Castanera
              rh-ee-tschneid Tony Schneider
              None
              None
              He Liu He Liu
              None
              Votes:
              0 Vote for this issue
              Watchers:
              4 Start watching this issue

                Created:
                Updated: