Uploaded image for project: 'Red Hat Advanced Cluster Management'
  1. Red Hat Advanced Cluster Management
  2. ACM-5062

Phase 1: Test a single dashboard for bottlenecks

XMLWordPrintable

    • Observability Sprint 2023-06
    • No

      Value Statement

      1. Exclusively focus on ACM - Resource Optimization  dashboard
        1. identify all metrics and labels that belong to the dashboard
          1. https://docs.google.com/document/d/1NphXns55SPJoJEoDed2a1YkhSIide1OWhQoWbLZeWNI/edit?usp=sharing
        2. Assume 1 namespace to 4000 7800 pods to mimic UPS env 
        3. generate 6 months of S3 thanosbench data
        4. test dashboard performance and identify find bottlenecks
          1. https://docs.google.com/document/d/1H2SI215PARNWXWAEV98XimYp2VeMyxCtxvKw8i9vb3E/edit?usp=sharing
      2. Ensure the following performance optimizations in place,
        1. increase memcache settings
        2. increased HA proxy time-outs
        3. optimized queries for enumerating managed clusters
        4. disable store PV usage altogether via Thanos 0.31 config options
      3. Definition of Done for Engineering Story Owner (Checklist)
      • ...

      Development Complete

      • The code is complete.
      • Functionality is working.
      • Any required downstream Docker file changes are made.

      Tests Automated

      • [ ] Unit/function tests have been automated and incorporated into the
        build.
      • [ ] 100% automated unit/function test coverage for new or changed APIs.

      Secure Design

      • [ ] Security has been assessed and incorporated into your threat model.

      Multidisciplinary Teams Readiness

      Support Readiness

      • [ ] The must-gather script has been updated.

              rh-ee-ngraham Nathaniel Graham
              smeduri1@redhat.com Subbarao Meduri
              Votes:
              0 Vote for this issue
              Watchers:
              4 Start watching this issue

                Created:
                Updated:
                Resolved: