XML

Word

Printable

Type: Epic
Resolution: Done
Priority: Major
Fix Version/s: CNV v4.16.0
Affects Version/s: None
Component/s: CNV Install, Upgrade and Operators
Labels:
- cnv-observability
- no-doc
- no-qe,
- no-ux

Epic Name:
cnv-actionable-telemetry-4.16
Activity Type:
Product / Portfolio Work
Acceptance Criteria:

Hide

Create bugs based on internal clusters suspicious alerts, based on observations of RH internal environments (cnv.engineering2).

Unclutter alerting on internal clusters, so they are clean and showing relevant alerts that require action.

Add additional information the the weekly report about non-reporting accounts

Show
Create bugs based on internal clusters suspicious alerts, based on observations of RH internal environments (cnv.engineering2). Unclutter alerting on internal clusters, so they are clean and showing relevant alerts that require action. Add additional information the the weekly report about non-reporting accounts
Current Status:
Green
Epic Status:
To Do
Feature Link:
VIRTSTRAT-264 - SD: Fleet Alerting Dashboards
Parent Link:
VIRTSTRAT-264SD: Fleet Alerting Dashboards
Hierarchy Progress Bar:

0% To Do, 0% In Progress, 100% Done
Ready-Ready:

dev-ready, doc-ready, po-ready, qe-ready, ux-ready
Status Summary:

Hide

in progress....

Show
in progress....

SFDC Cases Links:
SFDC Cases Open:
SFDC Cases Counter:

Goal

Use existing telemetry data to trigger practical actions. This is rather to process the data we have right now, than to add additional metrics or alerts. Examples are included in the user stories below.

Based on ~~CNV-31126~~ -
Created doc https://docs.google.com/spreadsheets/d/1xYCyn-Y35ZA3ABZNYowt9ZonCIoUtNps0P7GhrLMTFY/edit?usp=sharing that include logs that we need to check and this epic includes the spikes to go over each of the issues found in the logs.

User Stories

As an OpenShift Virtualization team member I'd like to see trends in the telemetry to get a feel on what 's going on in the field. One of the examples would be to group alerts by z and y streams, so with each released version we can see trends with telemetry data, can compare them with previous versions and take some actions.
As an OpenShift Virtualization team member I'd like to be notified about any suspicious trends in alerts (by a section dedicated to it in weekly telemeter report)
As an OpenShift Virtualization engineer/manager I'd like to have the bug open for suspiciously looking alerts so my team can investigate it.

Non-Requirements

List of things not included in this epic, to alleviate any doubt raised during the grooming process.

Notes

Any additional details or decisions made/needed

Done Checklist

Who	What	Reference
DEV	Upstream roadmap issue (or individual upstream PRs)	<link to GitHub Issue>
DEV	Upstream documentation merged	<link to meaningful PR>
DEV	gap doc updated	<name sheet and cell>
DEV	Upgrade consideration	<link to upgrade-related test or design doc>
DEV	CEE/PX summary presentation	label epic with cee-training and add a <link to your support-facing preso>
QE	Test plans in Polarion	<link or reference to Polarion>
QE	Automated tests merged	<link or reference to automated tests>
DOC	Downstream documentation merged	<link to meaningful PR>

clones

CNV-31123 Actionable Telemetry for internal clusters - 4.15

Closed

is related to

CNV-35858 virt-launcher Logs improvements

Refinement

CNV-35860 CDI Logs improvements - 4.16

Closed

CNV-35861 SSP Logs improvements - 4.17

Closed

CNV-35862 Network Logs improvements

Closed

Assignee:: Shirly Radco

Reporter:: Krzysztof Majcher

QA Contact:: Debarati Basu-Nag

Votes:: 0 Vote for this issue

Watchers:: 3 Start watching this issue

Created:: 2023/11/21 2:20 PM

Updated:: 2025/08/04 8:48 PM

Resolved:: 2024/06/10 9:33 AM

Details

Description

Goal

User Stories

Non-Requirements

Notes

Done Checklist

Attachments

Issue Links

Easy Agile Planning Poker

Activity

People

Dates