Description of problem:
From profiling on cert rotation we know that the node informer is called every couple of seconds on node heartbeats. This PR will ensure that all our node listers only ever listen/inform on the master node updates to reduce the frequency of unnecessary sync calls. Also related to the issue, increasing the amount of node status updates: OCPBUGS-29713 OCPBUGS-29424
Version-Release number of selected component (if applicable):
4.16 down to 4.12, we need to check all versions
How reproducible:
always
Steps to Reproduce:
1. create a cluster 2. look at some metric 3. observe improvement
Actual results:
increased amount of CPU usage for CEO
Expected results:
less amount of CPU consumed for CEO
Additional info:
already fixed in 4.16 with https://github.com/openshift/cluster-etcd-operator/pull/1205 creating this ticket for backporting
- blocks
-
OCPBUGS-29717 [4.15] Replace nodelister with master nodelister everywhere
- Closed
- is caused by
-
OCPBUGS-29713 Excessive node status updates causing high control plane CPU
- Closed
- is cloned by
-
OCPBUGS-29717 [4.15] Replace nodelister with master nodelister everywhere
- Closed
- links to
-
RHEA-2024:0041 OpenShift Container Platform 4.16.z bug fix update