Uploaded image for project: 'OpenShift Virtualization'
  1. OpenShift Virtualization
  2. CNV-45683

Improve resource contention metrics

XMLWordPrintable

    • improve-contention-related-metrics
    • 77
    • Hide
      • Enable schedstats=enable by default (to be confirmed that we want this)
      • Fix kubevirt_vmi_vcpu_wait_seconds_total documentation to be IO specific
      • Add documentation for kubevirt_vmi_vcpu_delay_seconds_total to be CPU scheduling specific
      • Make sure the metric is displayed in a dashboard
      • Explore if Pressure Stall Informations are valueable to detect contended workloads
      • Comparison CPU steal (guest) VS vCPU wait (qemu) VS PSI (cgroup)
      Show
      Enable schedstats=enable by default (to be confirmed that we want this) Fix kubevirt_vmi_vcpu_wait_seconds_total documentation to be IO specific Add documentation for kubevirt_vmi_vcpu_delay_seconds_total to be CPU scheduling specific Make sure the metric is displayed in a dashboard Explore if Pressure Stall Informations are valueable to detect contended workloads Comparison CPU steal (guest) VS vCPU wait (qemu) VS PSI (cgroup)
    • To Do
    • 100% To Do, 0% In Progress, 0% Done
    • doc-ready

      Goal

      Improve vCPU contention related metrics by

      See also: https://github.com/kubevirt/kubevirt/issues/10075

      In addition: A spike to explore how valueable PSI metrics are for VMs, if they complement vCPU ready.

      User Stories

      • As a cluster administrator, I want know when there are vCPU performance issues with my VM, so that I can take action

      Non-Requirements

      • List of things not included in this epic, to alleviate any doubt raised during the grooming process.

      Notes

      • Any additional details or decisions made/needed

          1.
          upstream roadmap issue Sub-task New Normal Unassigned
          2.
          upstream design Sub-task New Normal Unassigned
          3.
          upstream documentation Sub-task New Normal Unassigned
          4.
          upgrade consideration Sub-task New Normal Unassigned
          5.
          CEE/PX summary presentation Sub-task New Normal Unassigned
          6.
          test plans in polarion Sub-task New Normal Unassigned
          7.
          automated tests Sub-task New Normal Unassigned
          8.
          downstream documentation merged Sub-task New Normal Unassigned

              sgott@redhat.com Stuart Gott
              fdeutsch@redhat.com Fabian Deutsch
              Kedar Bidarkar Kedar Bidarkar
              Votes:
              0 Vote for this issue
              Watchers:
              7 Start watching this issue

                Created:
                Updated: