Uploaded image for project: 'OpenShift UX Product Design'
  1. OpenShift UX Product Design
  2. PD-1084

Explore: Add the ability to upgrade the control plane only (pause worker nodes) to the console


    • Icon: Story Story
    • Resolution: Done
    • Icon: Undefined Undefined
    • Admin
    • None
    • UXD Sprint 207, UXD Sprint 208

      Add the ability to choose between a full cluster upgrade or control plane upgrade (which will pause all worker pools) in the console.

      Currently in the console, users only have the ability to complete a full cluster upgrade. For many customers, upgrades take longer than what their maintenance window allows. Users need the ability to upgrade the control plane independently of the other worker nodes. 

      Ex. Upgrades of huge clusters may take too long so admins may do the control plane this weekend, worker-pool-A next weekend, worker-pool-B the weekend after, etc.  It is all at a pool level, they will not be able to choose specific hosts.

      What happens when the admin selects the control plane option and the control plane upgrade is complete. What happens to the worker and infra nodes? Do users have to take explicit action to upgrade those? Are there new alerts that we would need to account for?

      From Scott: Worker pools can remain paused as long as is normally safe, which means until certificate rotation becomes critical which is at about 60 days. The admin would be advised to unpause them in order to complete the full upgrade but there's a future where they may remain paused in order to "skip" part of the upgrade when upgrading across multiple minors. Components like the apisever will be responsible for setting conditions which inhibit upgrades that are unsafe in those scenarios using the existing Upgradeable=False condition which the console now displays starting in 4.9 I believe, I need to actually look at that UX.


      1. Add the ability to choose between a cluster upgrade and a control plane upgrade
      2. provide in context help information as to what the difference between those two choices are
      3. come up with a way a clear way to display that when the control upgrade is in place that the worker nodes are paused
      4. design the interaction for initiating the upgrades for each individual pool
      5. create an alert to let users know they are approaching the 60 day window and advise them to complete the rest of the upgrade 

      Additional Resources: 

            tle@redhat.com Thi Le (Inactive)
            mehall-1 Megan Hall
            0 Vote for this issue
            4 Start watching this issue