Loading...

XML

Word

Printable

Type: Bug
Resolution: Duplicate
Priority: Undefined
Fix Version/s: None
Affects Version/s: 4.13
Component/s: Management Console
Labels:
None

Activity Type:
Quality / Stability / Reliability
Blocked:
False
Blocked Reason:

Hide

None

Show
None
Story Points:
None
Severity:
Critical
Regression:
None

Target Backport Versions:
None
Target Version:

4.13
Release Blocker:
Proposed
Sprint:
None

SFDC Cases Counter:
SFDC Cases Open:
SFDC Cases Links:

Release Note Status:
None
Release Note Type:
None
Release Note Text:
None

Escape Reason:
None
Escape Impact:
None
Corrective Measures:
None
SDLC stage when should've been found:
None

When a console rollout occurs and one of the managed clusters is not reachable, the pods don't start, and the console deployment gets stuck in creating. Errors like this appear in the log:

E0117 12:02:39.538925 1 auth.go:232] error contacting auth provider (retrying in 10s): Get "https://api.jephilli-latest-01-16-0641.devcluster.openshift.com:6443/.well-known/oauth-authorization-server": dial tcp: lookup api.jephilli-latest-01-16-0641.devcluster.openshift.com on 172.30.0.10:53: no such host

We need to tolerate unresponsive clusters and can't block rollout if one is not responding. This could break cluster upgrades among other issues.

I tried to destroy the unresponsive cluster, but the console operator did not seem to trigger a new rollout. I had to manually delete the previous stuck pods to fix the issue.

Assignee:: Jon Jackson

Reporter:: Samuel Padgett

QA Contact:: YaDan Pei

Need Info From:: None

Votes:: 0 Vote for this issue

Watchers:: 3 Start watching this issue

Created:: 2023/01/17 1:40 PM

Updated:: 2025/07/28 11:32 AM

Resolved:: 2023/01/17 2:36 PM

Details

Description

Attachments

Easy Agile Planning Poker

Activity

People

Dates