Loading...

XML

Word

Printable

Type: Bug
Resolution: Unresolved
Priority: Undefined
Fix Version/s: None
Affects Version/s: 4.17, 4.18
Component/s: Etcd
Labels:
- edge-payload

Severity:
Important
Regression:
No
Story Points:
5
Sprint:
ETCD Sprint 259, ETCD Sprint 260, ETCD Sprint 261, ETCD Sprint 262, ETCD Sprint 263
sprint_count:
5
Release Blocker:
Rejected
Blocked:
False
Blocked Reason:

Hide

None

Show
None
Target Version:

4.18

SFDC Cases Counter:
SFDC Cases Open:
SFDC Cases Links:

deads reported in this thread that the static pod controller appears to sometimes deploy pods that do not show up in a reasonable timeframe, which occasionally triggers this test to fail (source job):

[sig-node] static pods should start after being created 

{  static pod lifecycle failure - static pod: "etcd" in namespace: "openshift-etcd" for revision: 7 on node: "ci-op-h9zjcc96-51425-8gcc2-master-0" didn't show up, waited: 3m0s}

David suspects that this actually happens far more often than the test failures indicate, however this test should be a good resource to find affected runs.

Test details indicates this fails up to 10% of the time on some job variants. The most common compnent affected appears to be kube-controller-manager, but apiserver and etcd are both appearing at times. Use the test details link if looking for more job runs.

Slack thread has more details from both deads@redhat.com and tjungblu@redhat.com.

Suspicion is that fixing this could improve install times and reliability.

- - Sort By Name
  - Sort By Date
  - Ascending
  - Descending
  - Thumbnails
  - List
  - Download All

HA 4.17.png
291 kB
2025/01/30 1:24 AM
HA 4.18.png
278 kB
2025/01/30 1:24 AM
image-2024-08-02-12-31-04-963.png
86 kB
2024/08/02 10:31 AM
image-2024-08-02-12-32-01-567.png
78 kB
2024/08/02 10:32 AM
image-2024-08-02-12-38-20-669.png
80 kB
2024/08/02 10:38 AM
image-2024-08-02-12-39-20-189.png
81 kB
2024/08/02 10:39 AM
image-2024-08-02-12-46-28-665.png
125 kB
2024/08/02 10:46 AM
image-2024-08-02-12-46-51-229.png
82 kB
2024/08/02 10:46 AM
SNO 4.17.png
247 kB
2025/01/30 1:24 AM
SNO 4.18.png
298 kB
2025/01/30 1:24 AM

duplicates

OCPBUGS-36604 etcd recovery test has static pod startup failure

Closed

is related to

OCPBUGS-36604 etcd recovery test has static pod startup failure

Closed

OCPBUGS-43631 Static pod controller pods sometimes fail to start [kube-controller-manager]

Closed

Assignee:: Haseeb Tariq

Reporter:: Devan Goodwin

QA Contact:: Ge Liu

Votes:: 0 Vote for this issue

Watchers:: 15 Start watching this issue

Created:: 2024/07/11 12:31 PM

Updated:: 2025/01/31 7:37 PM

Details

Description

Attachments

Attachments

Issue Links

Easy Agile Planning Poker

Activity

People

Dates