Loading...

XML

Word

Printable

Type: Bug
Resolution: Not a Bug
Priority: Normal
Fix Version/s: None
Affects Version/s: 4.10
Component/s: Cluster Autoscaler
Labels:
None

Activity Type:
Quality / Stability / Reliability
Blocked:
False
Blocked Reason:

Hide

None

Show
None
Story Points:
None
Severity:
None
Regression:
No

Target Backport Versions:
None
Target Version:
None
Release Blocker:
None
Sprint:
None

SFDC Cases Counter:
SFDC Cases Open:
SFDC Cases Links:

PX Priority Data:
PX Impact Score:

Release Note Status:
None
Release Note Type:
None
Release Note Text:
None

Escape Reason:
None
Escape Impact:
None
Corrective Measures:
None
SDLC stage when should've been found:
None

Description of problem:

Cluster autoscaler appears unable to scale nodes down. 

In cluster-autoscaler-default pod logs, we can see that the nodes are tainted for deletion and then repeatedly see "Skipping {node} from delete consideration - the node is currently being deleted" followed by "Nodegroup is nil for azure:///subscriptions/blah/blah/samenodename"

Customer tried draining the nodes in question to give the autoscaler a hand and to see if any stray pods holding on to storage on the nodes or something like that but no change.

According to customer, this morning (after several days of issues and reportedly without any change by customer), autoscaler worked fine and was successful in scaling down nodes.

More logs / info to follow and in support case 03528995

Version-Release number of selected component (if applicable):

How reproducible:

Steps to Reproduce:

1.
2.
3.

Actual results:

Expected results:

Additional info:

Assignee:: Michael McCune

Reporter:: Daniel Small

Need Info From:: None

Contributors:: None

QA Contact:: Zhaohua Sun

Doc Contact:: None

Votes:: 0 Vote for this issue

Watchers:: 4 Start watching this issue

Created:: 2023/06/07 6:25 PM

Updated:: 2025/09/12 11:40 PM

Resolved:: 2023/08/15 3:27 PM

Details

Description

Attachments

Easy Agile Planning Poker

Activity

People

Dates