-
Epic
-
Resolution: Unresolved
-
Critical
-
None
-
None
-
improve-contention-related-metrics
-
77
-
-
In Progress
-
67% To Do, 33% In Progress, 0% Done
-
doc-ready
Goal
Improve vCPU contention related metrics by
- make them work out of the box
- add documentation for existing metric kubevirt_vmi_vcpu_wait_seconds_total to https://docs.redhat.com/en/documentation/openshift_container_platform/4.16/html/virtualization/monitoring#virt-promql-vcpu-metrics_virt-prometheus-queries
- kubevirt_vmi_vcpu_wait_seconds_total is documented, but the docs lack, that the metric is IO specific (_delay metric is CPU scheduling specific)
See also: https://github.com/kubevirt/kubevirt/issues/10075
In addition: A spike to explore how valueable PSI metrics are for VMs, if they complement vCPU ready.
User Stories
- As a cluster administrator, I want know when there are vCPU performance issues with my VM, so that I can take action
Non-Requirements
- List of things not included in this epic, to alleviate any doubt raised during the grooming process.
Notes
- Any additional details or decisions made/needed
- is documented by
-
CNV-45694 Add documentation for kubevirt_vmi_vcpu_delay_seconds_total
-
- New
-
- is related to
-
CNV-46221 Add to the Red Hat docs a link to the CNV metrics documentation
-
- Closed
-
- relates to
-
VIRTSTRAT-65 Load Aware balancing
-
- In Progress
-
1.
|
upstream roadmap issue |
|
New | |
Unassigned |
2.
|
upstream design |
|
New | |
Unassigned |
3.
|
upstream documentation |
|
New | |
Unassigned |
4.
|
upgrade consideration |
|
New | |
Unassigned |
5.
|
CEE/PX summary presentation |
|
New | |
Unassigned |
6.
|
test plans in polarion |
|
New | |
Unassigned |
7.
|
automated tests |
|
New | |
Unassigned |
8.
|
downstream documentation merged |
|
New | |
Unassigned |