Loading...

XML

Word

Printable

Type: Feature
Resolution: Obsolete
Priority: Major
Fix Version/s: None
Affects Version/s: None
Component/s: Hosted Control Planes
Labels:
- FAC:Red
- ROSA

Activity Type:
Product / Portfolio Work
Parent Link:
None
Blocked:
False
Blocked Reason:

Hide

None

Show
None
Ready:
False
Size:
None

Target Version:
None
Release Blocker:
None

SFDC Cases Counter:
SFDC Cases Open:
SFDC Cases Links:

PX Review Complete:
None
PX Priority Data:
None
PX Impact Score:
PX Technical Impact:
None
PX Impact Range:
None
PX Scheduling Request:
None
PX Technical Impact Notes:
None

Intelligence Requested:
Market:

Feature Overview (aka. Goal Summary)

The SLO dashboard that we used to have in grafana stopped working due to January's sunsetting of the RHOBS that was being used in ROSA.

While we do no longer have the dashboard, the need for it has not changed nor has it been served completely by other means. The current situation is that we have some degree of coverage over our SLOs from the fact that important breaches in them would snowball and affect the overall ROSA/HCP SLOs that SREs track. However, the agility in that is not a situation that should persist in time.

Goals (aka. expected user outcomes)

Increased HyperShift Operator reliability in ROSA by closer tracking of trends.

Requirements (aka. Acceptance Criteria):

There is a Dashboard in Dynatrace specific to the operation of HyperShift
The dashboard needs to cover integration, stage, production canary and the different production sectors.
The hypershift release duty documentation is expanded to point to it and explain its maintenance and usage.
Weekly reporting of the team SLOs to SRE changes to it.
If possible it should be definied either in the hypershift github repo or in the hypershift team gitlab repo and synced to dynatrace.

Deployment considerations	List applicable specific needs (N/A = not applicable)
Self-managed, managed, or both	managed
Classic (standalone cluster)	no
Hosted control planes	yes
Multi node, Compact (three node), or Single node (SNO), or all	all supported ROSA configurations
Connected / Restricted Network	all supported ROSA configurations
Architectures, e.g. x86_x64, ARM (aarch64), IBM Power (ppc64le), and IBM Z (s390x)	all supported ROSA configurations
Operator compatibility	all supported ROSA configurations
Backport needed (list applicable versions)	no
UI need (e.g. OpenShift Console, dynamic plugin, OCM)	Only in Dynatrace
Other (please specify)

Use Cases (Optional):

Alerting of HyperShift malfunction
Tracking trends to the performance of HyperShift
Collecting metrics around known incidents and minor deterioration to inform decisions.

Out of Scope

New metrics and or panels over what we used to have.

Documentation Considerations

Only needs documenting in the team ops repository.

Interoperability Considerations

It would be useful to consider if it can be done in a way that is similar to what we'll need for ARO/HCP.

Assignee:: Antoni Segura Puimedon

Reporter:: Antoni Segura Puimedon

Need Info From:: None

Contributors:: None

Architect:: None

Developer:: Salvatore Dario Minonne

QA Contact:: None

Doc Contact:: None

Product Operations Engineering Contact:: None

Votes:: 0 Vote for this issue

Watchers:: 3 Start watching this issue

Created:: 2025/04/30 8:21 AM

Updated:: 2025/10/09 1:42 AM

Resolved:: 2025/10/08 1:03 PM

Target end:: 2025/12/01

Details

Description

Feature Overview (aka. Goal Summary)

Goals (aka. expected user outcomes)

Requirements (aka. Acceptance Criteria):

Use Cases (Optional):

Out of Scope

Documentation Considerations

Interoperability Considerations

Attachments

Easy Agile Planning Poker

Activity

People

Dates