-
Story
-
Resolution: Done
-
Critical
-
None
-
Quality / Stability / Reliability
-
5
-
False
-
-
False
-
-
-
-
GH Train-30
-
Critical
-
None
Value Statement
Ensure the issue title clearly reflects the value of this user story to the
intended persona. (Explain the "WHY")
As a system administrator,
I want to improve the migration workflow to ensure that the Failed state is only set after cleanup is complete, streamline the pending status based on the final state, and enhance error handling and testing for both Manager and Agent,
so that the migration process is more reliable, easier to monitor, and thoroughly tested for robustness across the global hub end-to-end codebase.
Definition of Done for Engineering Story Owner (Checklist)
1. Migration Failure as a final state:
2. Pending Status Simplification:
-
- The pending status must accurately reflect the Failed state for clear monitoring.
3. Error Handling:
-
- Error messages from the Agent must be structured to be captured by the Manager for effective debugging.
4. Testing:
-
- Comprehensive unit and integration tests are implemented for the Manager Controller workflow.
-
- Robust unit and integration tests are added for the Agent roadmap in both source and target hubs.
- KlusterConfig Interval:
-
- The KlusterConfig interval is set to a reasonable value before cluster re-registration.
Subtasks
1. Refactor Migration Controller for Manager:
-
- Update the migration process to initiate cleanup before setting the phase to Failed.
-
- Ensure Failed and Completed are the final states.
-
- Simplify the pending status to align with the Failed state.
-
- Add unit and integration tests for the Manager Controller workflow.
2. Refactor Migration workflow for Agent:
-
- Configure the KlusterConfig interval to a reasonable value before cluster re-registration.
-
- Structure Agent error messages to be captured by the Manager.
-
- Implement robust unit and integration tests for the Agent roadmap in both source and target hubs.
Development Complete
- The code is complete.
- Functionality is working.
- Any required downstream Docker file changes are made.
Tests Automated
- [x] Unit/function tests have been automated and incorporated into the
build. [ ] 100% automated unit/function test coverage for new or changed APIs.
Secure Design
[ ] Security has been assessed and incorporated into your threat model.
Multidisciplinary Teams Readiness
[ ] Create an informative documentation issue using the Customer
Portal Doc template that you can access from [The Playbook](
and ensure doc acceptance criteria is met.
Call out this sentence as it's own action:
[ ] Link the development issue to the doc issue.
Support Readiness
[ ] The must-gather script has been updated.