Uploaded image for project: 'OpenShift Bugs'
  1. OpenShift Bugs
  2. OCPBUGS-69447

Enable CVO access to OBO Prometheus for ROSA HCP with RHOBS monitoring

XMLWordPrintable

    • Icon: Bug Bug
    • Resolution: Unresolved
    • Icon: Critical Critical
    • None
    • 4.22.0
    • HyperShift
    • None
    • None
    • False
    • Hide

      None

      Show
      None
    • None
    • Important
    • None
    • None
    • None
    • None
    • None
    • None
    • None
    • None

      Description of problem:

      This work is part of OCM-10395, which aims to enable the Cluster Version Operator to access metrics for conditional update risk evaluation in ROSA HCP clusters using RHOBS monitoring. 
      
      In ROSA HCP with RHOBS monitoring, the Prometheus endpoint is the OBO Prometheus instance, not the management cluster's default Prometheus (CoreOS). Currently, when operators install HyperShift with the --rhobs-monitoring flag, CVO cannot access the OBO Prometheus endpoint for metrics-based conditional update risk evaluation. 
      
      We have validated in int-blue that the network policy and URL are correct to allow CVO connect to OBO, see details in https://issues.redhat.com/browse/OCM-19543
      
      Backporting:
      We need backport to 4.21.z and 4.20.z. 

      Version-Release number of selected component (if applicable):

          

      How reproducible:

      100% - All ROSA HCP clusters using RHOBS monitoring on 4.21.z and 4.20.z currently lack CVO metrics access

      Steps to Reproduce:

      1. Deploy HyperShift operator with --rhobs-monitoring flag enabled
      2. Create a ROSA HCP hosted cluster
      3. Examine the CVO deployment in the hosted control plane namespace
      4. Check if CVO pod has RBAC permissions to access metrics endpoint
      5. Verify network policies allow egress to OBO Prometheus     

      Actual results:

      1. CVO deployment does NOT have metrics access enabled
      2. CVO pod lacks RBAC permissions for metrics endpoint access
      3. Network policies do NOT allow egress to OBO Prometheus

      Expected results:

      1. CVO deployment should have metrics access automatically enabled when RHOBS monitoring is active
      2. Cluster should perform conditional update risk assessment at cluster level using actual cluster metrics

      Additional info:

      Related JIRA: 1. https://issues.redhat.com/browse/ROSA-377 2. https://issues.redhat.com/browse/OCM-10395

              Unassigned Unassigned
              rh-ee-chlu Chee Lu
              None
              None
              He Liu He Liu
              None
              Votes:
              0 Vote for this issue
              Watchers:
              7 Start watching this issue

                Created:
                Updated: