Uploaded image for project: 'OpenShift Workloads'
  1. OpenShift Workloads
  2. WRKLDS-937

Enable day2 add node using agent-install via oc

XMLWordPrintable

    • Icon: Epic Epic
    • Resolution: Done
    • Icon: Undefined Undefined
    • None
    • None
    • None
    • Day 2 Node Add Commands
    • BU Product Work
    • False
    • None
    • False
    • Yellow
    • In Progress
    • OCPSTRAT-1316 - [GA] Simplify and unify adding nodes to clusters on day 2
    • OCPSTRAT-1316[GA] Simplify and unify adding nodes to clusters on day 2
    • 0% To Do, 0% In Progress, 100% Done
    • Hide

      08/09/24 - Green

      • Support options based interface for the add-nodes-image command merged

      08/07/24 - Green

      • Guardrails to limit new command to OCP version 4.17 and higher merged
      • CI job to test adding nodes using oc adm node-image merged
      • Support options based interface for the add-nodes-image command is still in-progress.

      08/02/24 - Green

      • Monitor command merged
      • In-progress:
        • Adding guardrails to limit new command to OCP version 4.17 and higher
        • CI job to test adding nodes using oc adm node-image
        • Support options based interface for the add-nodes-image command - highly possible this will not be in 4.17 due to feedback, issues found in testing, and 1 week left before feature freeze.

      07/23/24 - Green

      •  
      • Possibility to descope minor task WRKLDS-1317, to be reviewed next week

      07/16/24 - Green

      • Create command merged
      • Monitor command in progress

      07/12/24 - Yellow

      Show
      08/09/24 - Green Support options based interface for the add-nodes-image command merged 08/07/24 - Green Guardrails to limit new command to OCP version 4.17 and higher merged CI job to test adding nodes using oc adm node-image merged Support options based interface for the add-nodes-image command is still in-progress. 08/02/24 - Green Monitor command merged In-progress: Adding guardrails to limit new command to OCP version 4.17 and higher CI job to test adding nodes using oc adm node-image Support options based interface for the add-nodes-image command - highly possible this will not be in 4.17 due to feedback, issues found in testing, and 1 week left before feature freeze. 07/23/24 - Green Monitor command under review Currently blocked from auth tasks https://github.com/openshift/installer/pull/8717   Possibility to descope minor task WRKLDS-1317 , to be reviewed next week 07/16/24 - Green Create command merged Monitor command in progress 07/12/24 - Yellow Commands skeleton ready. Create command implementation and tests ready, but both waiting approval from the workload team. Monitor command in progress PRs with approval pending: https://github.com/openshift/oc/pull/1808 https://github.com/openshift/oc/pull/1819
    • 33

      Epic Goal*

      Provide a simple commands for almost all users to add a node to a cluster where scaling up a MachineSet isn't an option - whether they have installed using UPI, Assisted or the agent-based installer, or can't use MachineSets for some other reason.

       
      Why is this important? (mandatory)

      • Enable easy day2 installation without requiring additional knowledge from the user
      • Unified experience for day1 and day2 installation for the agent based installer
      • Unified experience for day1 and day2 installation for appliance workflow
      • Eliminate the requirement of installing MCE that have high requirements (requires 4 cores and 16GB RAM for a multi-node cluster, and if the infrastructure operator is included then it will require storage as well)
      • Eliminate the requirement of nodes having a BMC available to expand bare metal clusters (see docs).
      • Simplify adding compute nodes based on the the UPI method or other method implemented in the field such as WKLD-433 or other automations that try to solve this problem

       
      Scenarios (mandatory) 

      Provide details for user scenarios including actions to be performed, platform specifications, and user personas.  

      1. User installed day1 cluster with agent based install and want to add workers or replace failed nodes, currently alternative is to install MCE or, if connected, use SAAS.

       
      Dependencies (internal and external) (mandatory)

      AGENT-682

      Contributing Teams(and contacts) (mandatory) 

      The installer team is developing the main body of the feature, which will run in the cluster to be expanded, as well as a prototype client-side script in AGENT-682. They will then be able to translate the client-side into native oc-adm subcommands.

      Acceptance Criteria (optional)

      Provide some (testable) examples of how we will know if we have achieved the epic goal.  

      Drawbacks or Risk (optional)

      Reasons we should consider NOT doing this such as: limited audience for the feature, feature will be superseded by other work that is planned, resulting feature will introduce substantial administrative complexity or user confusion, etc.

      Done - Checklist (mandatory)

      The following points apply to all epics and are what the OpenShift team believes are the minimum set of criteria that epics should meet for us to consider them potentially shippable. We request that epic owners modify this list to reflect the work to be completed in order to produce something that is potentially shippable.

      • CI Testing -  Basic e2e automationTests are merged and completing successfully
      • Documentation - Content development is complete.
      • QE - Test scenarios are written and executed successfully.
      • Technical Enablement - Slides are complete (if requested by PLM)
      • Engineering Stories Merged
      • All associated work items with the Epic are closed
      • Epic status should be “Release Pending” 

              afasano@redhat.com Andrea Fasano
              zabitter Zane Bitter
              Votes:
              3 Vote for this issue
              Watchers:
              13 Start watching this issue

                Created:
                Updated:
                Resolved: