-
Bug
-
Resolution: Done
-
Critical
-
4.9.z
-
None
Description of problem:
On a node configured with 500 pods per node, Pods fail to property start or return other failures
egrep 'CreateContainerError|ImageInspectError' pod_list.txt | wc -l
334
Version-Release number of selected component (if applicable):
OpenShift 4.9.45
Cri-o: cri-o://1.22.5-7.rhaos4.9.git3dbcd3c.el8
How reproducible: Frequently
Steps to Reproduce:
1. schedule more than 250 pods on a node as a single operation, for instance after rebooting a node on busy cluster
Actual results: Pods with containers errors
Expected results: Pod running
Additional info:
Looking at the crio stack trace of the issue, this looks that the fix at https://bugzilla.redhat.com/show_bug.cgi?id=2082344 did not catch completely the problem.