Uploaded image for project: 'Red Hat Advanced Cluster Management'
  1. Red Hat Advanced Cluster Management
  2. ACM-27615

[itn-2025-00330] Fix GC logic in OSDFM side

XMLWordPrintable

    • Icon: Epic Epic
    • Resolution: Unresolved
    • Icon: Major Major
    • None
    • None
    • Fleet Manager
    • None
    • [itn-2025-00330] Fix GC logic in OSDFM side
    • Quality / Stability / Reliability
    • 5
    • False
    • Hide

      None

      Show
      None
    • False
    • Not Selected
    • To Do
    • 86% To Do, 0% In Progress, 14% Done
    • OSDFM Sprint 3, OSDFM Sprint 4, OSDFM Sprint 5
    • Important

      OCP/Telco Definition of Done
      https://docs.google.com/document/d/1TP2Av7zHXz4_fmeX4q9HB0m9cqSZ4F6Jd4AiVoaF_2s/edit#heading=h.gaa58bzbvwde
      Epic Template descriptions and documentation.
      https://docs.google.com/document/d/14CUCEg6hQ_jpsFzJtWo29GfFVWmun2Uivrxq3_Fkgdg/edit
      ACM-wide Product Requirements (Top-level Epics)
      https://docs.google.com/document/d/1uIp6nS2QZ766UFuZBaC9USs8dW_I5wVdtYF9sUObYKg/edit

      *<--- Cut-n-Paste the entire contents of this description into your new
      Epic --->*

      Epic Goal

      This epic will fix the current logic of OSDFM Garbage collector component (GC) which is responsible for deleting unused MCs for cost saving.

      The issue in GC was identified during the incident  [itn-2025-00330]. The incident made the ROSA HCPs provision in ap-southeast-1 be unavailable since a MC was not deleted properly and it was still available in the placement decision. 

       

      We must:

      • Fix the current GC logic to not skip any cleanup step. Also, make sure it is stable and could not affect the ROSA HCP service availability.
      • Enable the GC in OSDFM Commercial and FEDRAMP for cost saving.

      Why is this important?

      • Cover a technical debt identified in OSDFM GC logic.
      • Cost saving deleting unused SCs and MCs.

      Scenarios

      ...

      Acceptance Criteria

      • Fix the GC logic and test in development environments.
      • Reenable the GC logic in production commercial and FEDRAMP.

      Dependencies (internal and external)

      1. ...

      Previous Work (Optional):

      1. ...

      Open questions:

      Done Checklist

      • CI - CI is running, tests are automated and merged.
      • Release Enablement <link to Feature Enablement Presentation>
      • DEV - Upstream code and tests merged: <link to meaningful PR or GitHub
        Issue>
      • DEV - Upstream documentation merged: <link to meaningful PR or GitHub
        Issue>
      • DEV - Downstream build attached to advisory: <link to errata>
      • QE - Test plans in Polarion: <link or reference to Polarion>
      • QE - Automated tests merged: <link or reference to automated tests>
      • DOC - Doc issue opened with a completed template. Separate doc issue
        opened for any deprecation, removal, or any current known
        issue/troubleshooting removal from the doc, if applicable.
      • Considerations were made for Extended Update Support (EUS)

              rh-ee-anrocha Andre Rocha
              rh-ee-anrocha Andre Rocha
              Andre Rocha, Chee Lu, Chunxi Luo
              Anna Francis Anna Francis
              Votes:
              0 Vote for this issue
              Watchers:
              2 Start watching this issue

                Created:
                Updated: