Loading...

XML

Word

Printable

Type: Bug
Resolution: Unresolved
Priority: Undefined
Fix Version/s: None
Affects Version/s: 4.19.0
Component/s: HyperShift
Labels:
None

Regression:
None
Blocked:
False
Blocked Reason:

Hide

None

Show
None
Release Note Text:

Hide
*Cause*: When HostedCluster is not available, the reason is not propagated correctly from HostedControlPlane in the condition.
*Consequence*: Only Status and Message are propagated for the "Available" condition in HostedCluster custom resource. Not the Reason.
*Fix*: The fix propagates also the Reason, not only Message and Status.
*Result*: The condition includes more information which helps identify the root cause of unavailability.

Show
*Cause*: When HostedCluster is not available, the reason is not propagated correctly from HostedControlPlane in the condition. *Consequence*: Only Status and Message are propagated for the "Available" condition in HostedCluster custom resource. Not the Reason. *Fix*: The fix propagates also the Reason, not only Message and Status. *Result*: The condition includes more information which helps identify the root cause of unavailability.
Release Note Type:
Bug Fix
Release Note Status:
In Progress
Target Version:

4.19.0

SFDC Cases Counter:
SFDC Cases Open:
SFDC Cases Links:

Description of problem:

The Reason for the Available condition should propagate from HostedControlPlane to HostedCluster together with Status and Message. Currently, only Status and Message is propagated: link. In this case we end up with KASLoadBalanderNotReachable in HCP and WaitForAvailable in HC, but we propagate the detailed message:

ᐅ oc get hc $CLUSTER_NAME -n $NAMESPACE -oyaml

  - lastTransitionTime: "2025-02-13T15:13:55Z"
    message: 'Get "https://ad470e4971ffe4f24bb0085802628868-46f6d7fdaaca476a.elb.us-east-1.amazonaws.com:6443/healthz":
      dial tcp: lookup ad470e4971ffe4f24bb0085802628868-46f6d7fdaaca476a.elb.us-east-1.amazonaws.com
      on 172.31.0.10:53: no such host'
    observedGeneration: 3
    reason: WaitingForAvailable
    status: "False"
    type: Available

ᐅ oc get hcp hc1 -n clusters-hc1 -oyaml

  - lastTransitionTime: "2025-02-13T15:14:09Z"
    message: 'Get "https://ad470e4971ffe4f24bb0085802628868-46f6d7fdaaca476a.elb.us-east-1.amazonaws.com:6443/healthz":
      dial tcp: lookup ad470e4971ffe4f24bb0085802628868-46f6d7fdaaca476a.elb.us-east-1.amazonaws.com
      on 172.31.0.10:53: no such host'
    observedGeneration: 1
    reason: KASLoadBalancerNotReachable
    status: "False"
    type: Available

Version-Release number of selected component (if applicable):

4.19.0

How reproducible:

Reproduced as part of https://issues.redhat.com/browse/OCPBUGS-49913 which uses cluster-wide proxy for the management cluster. In this case, the HCP and HC do not become available and show the errors above.

Steps to Reproduce:

Steps described in this JIRA comment

Actual results:

The Reason for HCP is KASLoadBalancerNotReachable while the reason for HC is WaitingForAvailable, but the Message is same in both cases.

Expected results:

The reason KASLoadBalancerNotReachable is propagated to HostedCluster.

Additional info:

links to

openshift/hypershift#5650: OCPBUGS-50907: Propagate Reason for Available condition from HCP to HC

RHEA-2024:11038 OpenShift Container Platform 4.19.z bug fix update

Assignee:: Martin Gencur

Reporter:: Martin Gencur

QA Contact:: Martin Gencur

Votes:: 0 Vote for this issue

Watchers:: 4 Start watching this issue

Created:: 2025/02/17 8:51 AM

Updated:: 2025/03/11 6:39 PM

Details

Description

Attachments

Issue Links

Easy Agile Planning Poker

Activity

People

Dates