-
Epic
-
Resolution: Done
-
Critical
-
None
-
None
Goal
There are several reasons causing a node hosting VMs to be not available anymore. Some reasons can be detected by the out-of-band management (or Baseboard Management Controller) of the node and forwarded by the event subscription service. These events should be used to trigger to fencing of the node and to recovery the nodes workload, especially the VMs.
User Stories
- As a cluster admin, I like VMs affected by a crashed node to be rescheduled to another node quickly, so that the downtime of the VMs is below 60 seconds.
Non-Requirements
- Storage based detection and remediation
- Optimize the flow in kubevirt
Notes
CNV-71846provides a PoC
- duplicates
-
CNV-68080 DP: Subscription-based Events Detection
-
- In Progress
-
1.
|
upstream roadmap issue |
|
New | |
Unassigned |
2.
|
upstream design |
|
New | |
Unassigned |
3.
|
upstream documentation |
|
New | |
Unassigned |
4.
|
upgrade consideration |
|
New | |
Unassigned |
5.
|
test plans in polarion |
|
New | |
Unassigned |
6.
|
automated tests |
|
New | |
Unassigned |
7.
|
downstream documentation merged |
|
New | |
Unassigned |
8.
|
CNV QE DevOps Requirement/Enablement |
|
New | |
Unassigned |