Loading...

XML

Word

Printable

Type: Feature Request
Resolution: Unresolved
Priority: Undefined
Fix Version/s: None
Affects Version/s: None
Component/s: Hosted Control Planes
Labels:

Target Version:
None
Activity Type:
Product / Portfolio Work
Status Summary:
None
Blocked:
False
Blocked Reason:

Hide

None

Show
None
Products:
None
Hierarchy Progress Bar:
None

SFDC Cases Counter:
SFDC Cases Open:
SFDC Cases Links:

PX Review Complete:
None
PX Impact Score:
PX Impact Range:
None
PX Priority Data:
None
PX Technical Impact:
None
PX Technical Impact Notes:
None
PX Scheduling Request:
None

1. Proposed title of this feature request

Node drain reporting improvements

2. What is the nature and description of the request?

This is a request to improve observability around worker node drains and their status.

What we're looking for:

a metric that reports node drain progress
- in case of failures, the pod and reason: e.g. the pods blocking it and why (e.g. pod abc because of pdb)
status on NodePools that reflects node drain failures, the offending pods and the reason
[optional] status on CAPI machine reflecting node drain failures, the offending pods and the reason

3. Why does the customer need this? (List the business requirements here)

Managed services (ROSA/ARO HCP) don't expose node drain failures to the customer. The node drain failures are currently only available in CAPI logs. Managed services SRE is looking for a metric to automate sending notifications to customers when a node drain fails due to their workload, as well as a status on the nodepools as a general UX improvement.

4. List any affected packages or components.

hypershift-operator, CAPI

relates to

OCPSTRAT-1615 [Observability] Enhanced Debuggability for HyperShift Cluster NodePool Failures

Assignee:: Ramon Acedo

Reporter:: Claudio Busse

Need Info From:: None

Votes:: 0 Vote for this issue

Watchers:: 5 Start watching this issue

Created:: 2025/02/26 9:35 AM

Updated:: 2025/07/03 1:22 PM

Target start:: None

Target end:: None

Details

Description

Attachments

Issue Links

Easy Agile Planning Poker

Activity

People

Dates