[OTA-1291] Correctly handle nodes that are members of multiple pools - Red Hat Issue Tracker

Type: Story
Resolution: Done
Priority: Normal
Fix Version/s: None
Affects Version/s: None
Component/s: None
Labels:
- groomed

Work Type:
BU Product Work
Story Points:
3
Blocked:
False
Blocked Reason:
None
Ready:
False
Epic Link:
Improved presentation in oc adm upgrade status command
Intelligence Requested:
Market:

Sprint:
OTA 256, OTA 257

SFDC Cases Links:
SFDC Cases Open:
SFDC Cases Counter:

Discovered by evakhoni@redhat.com during ~~OTA-1245~~, the 4.16 code did not take nodes that are members of multiple pools into account. This surfaced in several ways:

Duplicate insights (=we iterate over nodes over pools, so we see problematic edges in each pool it is a member of):

= Update Health =
SINCE   LEVEL     IMPACT           MESSAGE
-	Error     Update Stalled   Node ip-10-0-26-198.us-east-2.compute.internal is degraded
-	Error     Update Stalled   Node ip-10-0-26-198.us-east-2.compute.internal is degraded

Such node is present in all pool listings, and in some cases such as paused pools the output is confusing (paused-ness is a property of a pool, so we list a node as paused in one pool but outdated pending in another):

= Worker Pool =
Worker Pool:     mcpfoo
Assessment:      Excluded
...

Worker Pool Node
NAME                                        ASSESSMENT   PHASE    VERSION   EST   MESSAGE
ip-10-0-26-198.us-east-2.compute.internal   Excluded     Paused   4.15.12   -

= Worker Pool =
Worker Pool:     worker
...
Worker Pool Nodes
NAME                                        ASSESSMENT   PHASE     VERSION                              EST   MESSAGE 
ip-10-0-26-198.us-east-2.compute.internal   Outdated     Pending   4.15.12                              ?

It is not clear to me what would be the correct presentation of this case. Because this is an update status (and not node or cluster status) command, and only a single pool drives an update of a node, I'm thinking that maybe the best course of action would be to show only nodes whose version is driven by a given pool, or maybe come up with a "externally driven"-like assessment or whatever.

- - Sort By Name
  - Sort By Date
  - Ascending
  - Descending
  - Thumbnails
  - List
  - Download All

Screenshot 2024-09-03 at 9.38.49 AM.jpg
157 kB
2024/09/03 1:47 PM

incorporates

OCPBUGS-37169 worker pool status should reflect the correct number of nodes

Closed

is related to

OTA-1309 Ensure the node in a single-node cluster is handled correctly

Closed

split from

OTA-1245 post-merge testing: OTA-1165 - worker node status

Closed

links to

openshift/oc#1822: OTA-1291: upgrade status: removes custom nodes from the worker pool

openshift/oc#1825: OTA-1291: upgrade status: removes custom nodes from the worker pool (2)

Assignee:: Hongkai Liu

Reporter:: Petr Muller

Contributors:: Evgeni Vakhonin

QA Contact:: Evgeni Vakhonin

Votes:: 0 Vote for this issue

Watchers:: 4 Start watching this issue

Created:: 2024/05/17 3:45 PM

Updated:: 2024/09/05 12:17 AM

Resolved:: 2024/07/26 1:16 PM

Details

Description

Attachments

Attachments

Issue Links

Easy Agile Planning Poker

Activity

People

Dates

Hide