-
Bug
-
Resolution: Unresolved
-
Undefined
-
None
-
4.17.0
-
Important
-
No
-
5
-
ETCD Sprint 259, ETCD Sprint 260, ETCD Sprint 261, ETCD Sprint 262
-
4
-
Rejected
-
False
-
deads reported in this thread that the static pod controller appears to sometimes deploy pods that do not show up in a reasonable timeframe, which occasionally triggers this test to fail (source job):
[sig-node] static pods should start after being created { static pod lifecycle failure - static pod: "etcd" in namespace: "openshift-etcd" for revision: 7 on node: "ci-op-h9zjcc96-51425-8gcc2-master-0" didn't show up, waited: 3m0s}
David suspects that this actually happens far more often than the test failures indicate, however this test should be a good resource to find affected runs.
Test details indicates this fails up to 10% of the time on some job variants. The most common compnent affected appears to be kube-controller-manager, but apiserver and etcd are both appearing at times. Use the test details link if looking for more job runs.
Slack thread has more details from both deads@redhat.com and tjungblu@redhat.com.
Suspicion is that fixing this could improve install times and reliability.
- duplicates
-
OCPBUGS-36604 etcd recovery test has static pod startup failure
- Closed
- is related to
-
OCPBUGS-43631 Static pod controller pods sometimes fail to start [kube-controller-manager]
- Verified
-
OCPBUGS-36604 etcd recovery test has static pod startup failure
- Closed