Uploaded image for project: 'OpenShift Virtualization'
  1. OpenShift Virtualization
  2. CNV-45086

Live Migration stuck due to failure to unplug sriov device

XMLWordPrintable

    • Incidents & Support
    • 0.42
    • False
    • Hide

      None

      Show
      None
    • False
    • None
    • Moderate
    • None

      Description of problem:

      When trying to drain a node, a Live Migration of a VM gets stuck in a loop and fails due to: 
      
      reason:Live migration failed error encountered during preparing domain for migration: failed to detach hostdev <hostdev type="pci" managed="no" mode="subsystem"><source><address type="" domain="0x0000" bus="0xd8" slot="0x0a" function="0x7"></address></source><address type="pci" domain="0x0000" bus="0x20" slot="0x00" function="0x0"></address><alias name="ua-sriov-nic-2"></alias></hostdev>, err: virError(Code=1, Domain=10, Message='internal error: unable to execute QEMU command 'device_del': Device ua-sriov-nic-2 is already in the process of unplug')
      
      VM has 11 sriov devices: 
      
             interfaces:
                - bridge: {}
                  macAddress: '52:54:00:02:01:00'
                  model: virtio
                  name: nic-1
                - macAddress: '52:54:00:02:01:01'
                  model: virtio
                  name: nic-2
                  pciAddress: '0000:20:00.0'
                  sriov: {}
                - macAddress: '52:54:00:02:01:02'
                  model: virtio
                  name: nic-3
                  pciAddress: '0000:21:00.0'
                  sriov: {}
                - macAddress: '52:54:00:02:01:03'
                  model: virtio
                  name: nic-4
                  pciAddress: '0000:22:00.0'
                  sriov: {}
                - macAddress: '52:54:00:02:01:04'
                  model: virtio
                  name: nic-5
                  pciAddress: '0000:23:00.0'
                  sriov: {}
                - macAddress: '52:54:00:02:01:05'
                  model: virtio
                  name: nic-6
                  pciAddress: '0000:24:00.0'
                  sriov: {}
                - macAddress: '52:54:00:02:01:06'
                  model: virtio
                  name: nic-7
                  pciAddress: '0000:25:00.0'
                  sriov: {}
                - macAddress: '52:54:00:02:01:07'
                  model: virtio
                  name: nic-8
                  pciAddress: '0000:26:00.0'
                  sriov: {}
                - macAddress: '52:54:00:02:01:08'
                  model: virtio
                  name: nic-9
                  pciAddress: '0000:27:00.0'
                  sriov: {}
                - macAddress: '52:54:00:02:01:09'
                  model: virtio
                  name: nic-10
                  pciAddress: '0000:28:00.0'
                  sriov: {}
                - macAddress: 52:54:00:02:01:0a
                  model: virtio
                  name: nic-11
                  pciAddress: '0000:29:00.0'
                  sriov: {}

       

      Version-Release number of selected component (if applicable):

      OCP 4.12.30
      OCV 4.12.7 

       

      How reproducible:

      Happened on one VM out of many.  Unclear if it's reproducible. 

      Steps to Reproduce:

      1.
      2.
      3.
      

      Actual results:

       

      Expected results:

      Successful Migration 

      Additional info:

       

              omisan@redhat.com Orel Misan
              shaselde@redhat.com Sean Haselden
              Yoss Segev Yoss Segev
              Votes:
              0 Vote for this issue
              Watchers:
              9 Start watching this issue

                Created:
                Updated:
                Resolved: