Goal

There are several reasons causing a node hosting VMs to be not available anymore. Some reasons can be detected by the out-of-band management (or Baseboard Management Controller) of the node and forwarded by the event subscription service. These events should be used to trigger to fencing of the node and to recovery the nodes workload, especially the VMs.

User Stories

As a cluster admin, I like VMs affected by a crashed node to be rescheduled to another node quickly, so that the downtime of the VMs is below 60 seconds.

Non-Requirements

Storage based detection and remediation
Optimize the flow in kubevirt

Notes

~~CNV-71846~~ provides a PoC

duplicates

CNV-68080 DP: Subscription-based Events Detection

In Progress

1.	upstream roadmap issue	New	Unassigned
2.	upstream design	New	Unassigned
3.	upstream documentation	New	Unassigned
4.	upgrade consideration	New	Unassigned
5.	test plans in polarion	New	Unassigned
6.	automated tests	New	Unassigned
7.	downstream documentation merged	New	Unassigned
8.	CNV QE DevOps Requirement/Enablement	New	Unassigned

Assignee:: Dominik Holler

Reporter:: Unassigned

QA Contact:: Geetika Kapoor

Votes:: 0 Vote for this issue

Watchers:: 1 Start watching this issue

Created:: 2025/11/14 1:36 PM

Updated:: 2025/11/14 2:15 PM

Resolved:: 2025/11/14 2:13 PM

Details

Description

Goal

User Stories

Non-Requirements

Notes

Attachments

Issue Links

Easy Agile Planning Poker

Sub-Tasks

Activity

People

Dates