-
Bug
-
Resolution: Done-Errata
-
Normal
-
4.11
-
None
-
MCO Sprint 247, MCO Sprint 248
-
2
-
False
-
-
-
Bug Fix
-
Done
Description of problem:
MCO taking too much time to update the node count for MCP when removing labels from node which MCP uses to match with nodes
Version-Release number of selected component (if applicable):
How reproducible:
100%
Steps to Reproduce:
1. Remove `node-role.kubernetes.io/worker=` label from any worker node. ~~~ # oc label node worker-0.sharedocp4upi411ovn.lab.upshift.rdu2.redhat.com node-role.kubernetes.io/worker- ~~~ 2. Check MCP worker for correct node count. ~~~ # oc get mcp worker NAME CONFIG UPDATED UPDATING DEGRADED MACHINECOUNT READYMACHINECOUNT UPDATEDMACHINECOUNT DEGRADEDMACHINECOUNT AGE worker rendered-worker-6916abae250ad092875791f8297c13e1 True False False 3 3 3 0 5d7h ~~~ 3. Check after 10-15 mins ~~~ # oc get mcp worker NAME CONFIG UPDATED UPDATING DEGRADED MACHINECOUNT READYMACHINECOUNT UPDATEDMACHINECOUNT DEGRADEDMACHINECOUNT AGE worker rendered-worker-6916abae250ad092875791f8297c13e1 True False False 2 2 2 0 5d7h ~~~
Actual results:
It took 10-15 mins for MCP to detect node removal.
Expected results:
It should detect node removal as soon as the appropriate label from the node gets missing.
Additional info:
- relates to
-
MCO-452 [tech-preview] Proper state reporting when the MCO changes state
- Closed
- links to
-
RHEA-2024:0041 OpenShift Container Platform 4.16.z bug fix update