Uploaded image for project: 'OpenShift Virtualization'
  1. OpenShift Virtualization
  2. CNV-51229

Target pod requires a longer time than needed to go in succeeded status

XMLWordPrintable

    • Icon: Bug Bug
    • Resolution: Unresolved
    • Icon: Undefined Undefined
    • None
    • None
    • CNV Virt-Node
    • None
    • CNV Virtualization Sprint 262
    • None

      Description of problem:

      Currently, when a migration fails we signal the target pod to cleanup, and then we rely on a domain change event from the informer to unmount all the volumes (containerdisks, etc...).
      Here we need to distinguish 2 flows:
      - A domain is not yet detected on the target (The migration did not started)
      - A migration is already started / domain detected.
      
      In the first case the virt-launcher straight panics. Therefore no event will be send to the handler causing it to wait for some other trigger to reconcile this launcher again.
      In the second case the monitor exits and we wait for notification to happen (unclear if there is case where we don't send it) and then we kill the domain and wait again (this most likely produces event).

      Version-Release number of selected component (if applicable):

       

      How reproducible:

      Failures were observed: https://issues.redhat.com/browse/CNV-34566

      Steps to Reproduce:

      1.
      2.
      3.
      

      Actual results:

       

      Expected results:

       

      Additional info:

       

              ffossemo@redhat.com Federico Fossemo
              ffossemo@redhat.com Federico Fossemo
              Denys Shchedrivyi Denys Shchedrivyi
              Votes:
              0 Vote for this issue
              Watchers:
              4 Start watching this issue

                Created:
                Updated: