Uploaded image for project: 'OpenShift Hosted Control Plane'
  1. OpenShift Hosted Control Plane
  2. HOSTEDCP-975

Review NodePool metrics and set some internal SLOs/SLIs

    XMLWordPrintable

Details

    • Story
    • Resolution: Done
    • Undefined
    • None
    • None
    • None
    • None
    • False
    • None
    • False
    • Hypershift Sprint 236, Hypershift Sprint 237
    • 0
    • 0
    • 0

    Description

      Follow up for https://issues.redhat.com/browse/HOSTEDCP-969

      Create metrics and grafana panel in

      https://hypershift-monitoring.homelab.sjennings.me:3000/d/PGCTmCL4z/hypershift-slos-slis-alberto-playground?orgId=1&from=now-24h&to=now

      https://github.com/openshift/hypershift/tree/main/contrib/metrics

      for NodePool internal SLOs/SLIs:

      • NodePoolDeletionDuration
      • NodePoolInitialRolloutDuration

      Move existing metrics when possible from metrics loop into nodepool controller:

      - nodePoolSize

      Explore and discuss granular metrics to track NodePool lifecycle bottle necks, infra, ignition, node networking, available. Consolidate that with hostedClusterTransitionSeconds metrics and dashboard panels

      Explore and discuss metrics for upgrade duration SLO for both HC and NodePool.

      Attachments

        Activity

          People

            rh-ee-mraee Mulham Raee
            agarcial@redhat.com Alberto Garcia Lamela
            Votes:
            0 Vote for this issue
            Watchers:
            5 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: