Loading...

XML

Word

Printable

Type: Bug
Resolution: Done
Priority: Major
Fix Version/s: 4.19.0
Affects Version/s: 4.18
Component/s: Installer / Single Node OpenShift
Labels:
- edge-payload
- triaged

Activity Type:
Quality / Stability / Reliability
Blocked:
False
Blocked Reason:

Hide

None

Show
None
Story Points:
0
Severity:
Moderate
Regression:
None

Target Backport Versions:
None
Target Version:

4.19.0
Release Blocker:
Proposed
Sprint:
OCPEDGE Sprint 264
sprint_count:
1

SFDC Cases Counter:
SFDC Cases Open:
SFDC Cases Links:

Release Note Status:
In Progress
Release Note Type:
Release Note Not Required
Release Note Text:
None

Escape Reason:
None
Escape Impact:
None
Corrective Measures:
None
SDLC stage when should've been found:
None

Context Thread

As a maintainer of the SNO CI lane, I would like to ensure that the following test doesn't failure regularly as part of SNO CI.

[sig-architecture] platform pods in ns/openshift-e2e-loki should not exit an excessive amount of times

This issue is a symptom of a greater problem with SNO where there is downtime in resolving DNS after the upgrade reboot where the DNS operator has an outage while its deploying the new DNS pods. During that time, loki exists after hitting the following error:

2024/10/23 07:21:32 OIDC provider initialization failed: Get "https://sso.redhat.com/auth/realms/redhat-external/.well-known/openid-configuration": dial tcp: lookup sso.redhat.com on 172.30.0.10:53: read udp 10.128.0.4:53104->172.30.0.10:53: read: connection refused

This issue is important because it can contribute to payload rejection in our blocking CI jobs.

Acceptance Criteria:

Problem is discussed with the networking team to understand the best path to resolution and decision is documented
Either the DNS operator or test are adjusted to address or mitigate the issue.
CI is free from the issue in test results for an extended period. (Need to confirm how often we're seeing it first before this period can be defined with confidence).

blocks

OCPBUGS-46019 [release-4.18] Loki on SNO throws excessive restarts while waiting for DNS deployment

Closed

is cloned by

OCPBUGS-46019 [release-4.18] Loki on SNO throws excessive restarts while waiting for DNS deployment

Closed

is related to

OCPBUGS-42777 SNO Regression for [sig-network-edge] Verify DNS availability during and after upgrade success

Closed

relates to

OCPBUGS-42777 SNO Regression for [sig-network-edge] Verify DNS availability during and after upgrade success

Closed

links to

openshift/origin#29329: OCPBUGS-44970: Excluding loki prod-bearer-token container from excessive restarts test in SNO

Assignee:: Jeremy Poulin

Reporter:: Jeremy Poulin

Need Info From:: None

Contributors:: None

QA Contact:: Neil Hamza

Doc Contact:: None

Votes:: 0 Vote for this issue

Watchers:: 3 Start watching this issue

Created:: 2024/11/25 4:46 PM

Updated:: 2025/07/18 1:33 PM

Resolved:: 2024/12/19 10:25 AM

Details

Description

Attachments

Issue Links

Easy Agile Planning Poker

Activity

People

Dates