Uploaded image for project: 'Red Hat Advanced Cluster Management'
  1. Red Hat Advanced Cluster Management
  2. ACM-21410

Enhance Managed Cluster Migration Workflow and Testing for Manager and Agent

XMLWordPrintable

    • GH Train-30
    • Critical
    • None

      Value Statement

      Ensure the issue title clearly reflects the value of this user story to the
      intended persona. (Explain the "WHY")

      As a system administrator,
      I want to improve the migration workflow to ensure that the Failed state is only set after cleanup is complete, streamline the pending status based on the final state, and enhance error handling and testing for both Manager and Agent,
      so that the migration process is more reliable, easier to monitor, and thoroughly tested for robustness across the global hub end-to-end codebase.

      Definition of Done for Engineering Story Owner (Checklist)

      1. Migration Failure as a final state:

      2. Pending Status Simplification:

        • The pending status must accurately reflect the Failed state for clear monitoring.

      3. Error Handling:

        • Error messages from the Agent must be structured to be captured by the Manager for effective debugging.

      4. Testing:

        • Comprehensive unit and integration tests are implemented for the Manager Controller workflow.
        • Robust unit and integration tests are added for the Agent roadmap in both source and target hubs.
      1. KlusterConfig Interval:
        • The KlusterConfig interval is set to a reasonable value before cluster re-registration.

      Subtasks

      1. Refactor Migration Controller for Manager:

        • Update the migration process to initiate cleanup before setting the phase to Failed.
        • Ensure Failed and Completed are the final states.
        • Simplify the pending status to align with the Failed state.
        • Add unit and integration tests for the Manager Controller workflow.

      2. Refactor Migration workflow for Agent:

        • Configure the KlusterConfig interval to a reasonable value before cluster re-registration.
        • Structure Agent error messages to be captured by the Manager.
        • Implement robust unit and integration tests for the Agent roadmap in both source and target hubs.

       

      Development Complete

      • The code is complete.
      • Functionality is working.
      • Any required downstream Docker file changes are made.

      Tests Automated

      • [x] Unit/function tests have been automated and incorporated into the
        build.
      • [ ] 100% automated unit/function test coverage for new or changed APIs.

      Secure Design

      • [ ] Security has been assessed and incorporated into your threat model.

      Multidisciplinary Teams Readiness

      • [ ] Create an informative documentation issue using the Customer

      Portal Doc template that you can access from [The Playbook](

      https://docs.google.com/document/d/1YTqpZRH54Bnn4WJ2nZmjaCoiRtqmrc2w6DdQxe_yLZ8/edit#heading=h.9fvyr2rdriby),

      and ensure doc acceptance criteria is met.

      • Call out this sentence as it's own action:
      • [ ] Link the development issue to the doc issue.

      Support Readiness

      • [ ] The must-gather script has been updated.

          There are no Sub-Tasks for this issue.

              rh-ee-myan Meng Yan
              rh-ee-myan Meng Yan
              Yaheng Liu Yaheng Liu
              Votes:
              0 Vote for this issue
              Watchers:
              3 Start watching this issue

                Created:
                Updated:
                Resolved: