Uploaded image for project: 'OpenShift Virtualization'
  1. OpenShift Virtualization
  2. CNV-35196

Non-blocking / Best effort live migration during eviction

XMLWordPrintable

    • best-effort-eviction
    • False
    • Hide

      None

      Show
      None
    • False
    • Hide
      • MUST evaluate if PDBs are still the right way to block VMs, as PDBS ultimately block the upgrade
      • MUST provide a best effort startegy for eventually forcing upgrades to roll through
      • MUST address plain evictions
      • MUST address evictions as part of upgrade flows
      Show
      MUST evaluate if PDBs are still the right way to block VMs, as PDBS ultimately block the upgrade MUST provide a best effort startegy for eventually forcing upgrades to roll through MUST address plain evictions MUST address evictions as part of upgrade flows
    • To Do
    • CNV-8280 - VM balancing
    • 73% To Do, 9% In Progress, 18% Done
    • doc-ready, ux-ready
    • ---
    • ---

      Goal

      virt-launcher pods are evicted during CNV and OCP updates. When live migration is used during eviction, it's possible to block the eviction indefinitely if live migration fails continuously. 

       

      The goal is to provide a cluster scoped tuning that treats live migration during updates (Both OCP and CNV) as best effort. Best effort means that live migration should be attempted multiple times, but eventually give up and fall back to a restart.

      User Stories

      • As a CNV cluster admin, during a CNV update I want to ensure all VMI workloads are eventually updated with new virt-launchers without intervention in the event that live migration of a VMI continually fails.
      • As an OCP cluster admin, I don't want it to be possible for KubeVirt virtual machines to block OCP node updates indefinitely due to failed live migration.

      Non-Requirements

      Notes

      • Any additional details or decisions made/needed

      Done Checklist

      Who What Reference
      DEV Upstream roadmap issue (or individual upstream PRs) <link to GitHub Issue>
      DEV Upstream documentation merged <link to meaningful PR>
      DEV gap doc updated <name sheet and cell>
      DEV Upgrade consideration <link to upgrade-related test or design doc>
      DEV CEE/PX summary presentation label epic with cee-training and add a <link to your support-facing preso>
      QE Test plans in Polarion <link or reference to Polarion>
      QE Automated tests merged <link or reference to automated tests>
      DOC Downstream documentation merged <link to meaningful PR>

            sgott@redhat.com Stuart Gott
            rhn-engineering-dvossel David Vossel
            Debarati Basu-Nag Debarati Basu-Nag
            Votes:
            0 Vote for this issue
            Watchers:
            9 Start watching this issue

              Created:
              Updated: