Loading...

XML

Word

Printable

Type: Bug
Resolution: Done-Errata
Priority: Critical
Fix Version/s: None
Affects Version/s: 4.18
Component/s: HyperShift
Labels:
- triaged

Activity Type:
Quality / Stability / Reliability
Blocked:
False
Blocked Reason:

Hide

None

Show
None
Story Points:
None
Severity:
Critical
Regression:
None

Target Backport Versions:
None
Target Version:

4.18.0
Release Blocker:
Rejected
Sprint:
None

SFDC Cases Counter:
SFDC Cases Open:
SFDC Cases Links:

Release Note Status:
None
Release Note Type:
None
Release Note Text:
None

Escape Reason:
None
Escape Impact:
None
Corrective Measures:
None
SDLC stage when should've been found:
None

Description of problem:

HO bump to v0.1.48 fails due to new HO crashing 

panic: runtime error: invalid memory address or nil pointer dereference
[signal SIGSEGV: segmentation violation code=0x1 addr=0x0 pc=0x37a4d7e]
goroutine 69 [running]:
github.com/openshift/hypershift/hypershift-operator/controllers/hostedcluster/metrics.(*hostedClustersMetricsCollector).Collect(0xc000e3cbc0, 0xc0007082a0)
    /hypershift/hypershift-operator/controllers/hostedcluster/metrics/metrics.go:476 +0x2cfe
github.com/prometheus/client_golang/prometheus.(*Registry).Gather.func1()
    /hypershift/vendor/github.com/prometheus/client_golang/prometheus/registry.go:455 +0x189
created by github.com/prometheus/client_golang/prometheus.(*Registry).Gather in goroutine 181
    /hypershift/vendor/github.com/prometheus/client_golang/prometheus/registry.go:547 +0xddd

Version-Release number of selected component (if applicable):

    4.18

How reproducible:

100%

Steps to Reproduce:

    1. Bump HO from v0.1.47 to v0.1.48
    2.
    3.

Actual results:

    fails to use new HO version

Expected results:

    Sucessfully uses new HO version

Additional info:

    this is caused because the etcdRecoveryActive status is only present when the job is trigerred and we dont nil check it during metric collection

links to

openshift/hypershift#4846: OCPBUGS-42737: add nil check to etcdRecoveryActiveCondition when coll…

RHEA-2024:6122 OpenShift Container Platform 4.18.z bug fix update

Assignee:: Patryk Stefanski

Reporter:: Patryk Stefanski

Need Info From:: None

Contributors:: None

QA Contact:: He Liu

Doc Contact:: None

Votes:: 0 Vote for this issue

Watchers:: 6 Start watching this issue

Created:: 2024/10/03 1:30 PM

Updated:: 2025/07/20 1:23 PM

Resolved:: 2025/02/25 4:43 AM

Details

Description

Attachments

Issue Links

Easy Agile Planning Poker

Activity

People

Dates

Hide