-
Epic
-
Resolution: Unresolved
-
Major
-
None
-
None
-
None
-
[itn-2025-00330] Fix GC logic in OSDFM side
-
Quality / Stability / Reliability
-
5
-
False
-
-
False
-
Not Selected
-
To Do
-
86% To Do, 0% In Progress, 14% Done
-
OSDFM Sprint 3, OSDFM Sprint 4, OSDFM Sprint 5
-
Important
OCP/Telco Definition of Done
https://docs.google.com/document/d/1TP2Av7zHXz4_fmeX4q9HB0m9cqSZ4F6Jd4AiVoaF_2s/edit#heading=h.gaa58bzbvwde
Epic Template descriptions and documentation.
https://docs.google.com/document/d/14CUCEg6hQ_jpsFzJtWo29GfFVWmun2Uivrxq3_Fkgdg/edit
ACM-wide Product Requirements (Top-level Epics)
https://docs.google.com/document/d/1uIp6nS2QZ766UFuZBaC9USs8dW_I5wVdtYF9sUObYKg/edit
*<--- Cut-n-Paste the entire contents of this description into your new
Epic --->*
Epic Goal
This epic will fix the current logic of OSDFM Garbage collector component (GC) which is responsible for deleting unused MCs for cost saving.
The issue in GC was identified during the incident [itn-2025-00330]. The incident made the ROSA HCPs provision in ap-southeast-1 be unavailable since a MC was not deleted properly and it was still available in the placement decision.
We must:
- Fix the current GC logic to not skip any cleanup step. Also, make sure it is stable and could not affect the ROSA HCP service availability.
- Enable the GC in OSDFM Commercial and FEDRAMP for cost saving.
Why is this important?
- Cover a technical debt identified in OSDFM GC logic.
- Cost saving deleting unused SCs and MCs.
Scenarios
...
Acceptance Criteria
- Fix the GC logic and test in development environments.
- Reenable the GC logic in production commercial and FEDRAMP.
Dependencies (internal and external)
- ...
Previous Work (Optional):
- ...
Open questions:
- …
Done Checklist
- CI - CI is running, tests are automated and merged.
- Release Enablement <link to Feature Enablement Presentation>
- DEV - Upstream code and tests merged: <link to meaningful PR or GitHub
Issue> - DEV - Upstream documentation merged: <link to meaningful PR or GitHub
Issue> - DEV - Downstream build attached to advisory: <link to errata>
- QE - Test plans in Polarion: <link or reference to Polarion>
- QE - Automated tests merged: <link or reference to automated tests>
- DOC - Doc issue opened with a completed template. Separate doc issue
opened for any deprecation, removal, or any current known
issue/troubleshooting removal from the doc, if applicable. - Considerations were made for Extended Update Support (EUS)