Uploaded image for project: 'OpenShift Virtualization'
  1. OpenShift Virtualization
  2. CNV-19200

[2098225] [4.11] VM Snapshot Restore hangs indefinitely when backed by a snapshotclass

    XMLWordPrintable

Details

    • Storage Core Sprint 223, Storage Core Sprint 224, Storage Core Sprint 225, Storage Core Sprint 226
    • High

    Description

      +++ This bug was initially created as a clone of Bug #2070366 +++

      Description of problem:

      New "restore" PVC appears to be waiting on a CDI upload server pod to finish, but work is never sent to that pod.

      Version-Release number of selected component (if applicable):
      4.10.0

      How reproducible:
      Always

      Steps to Reproduce:
      1. Take a snapshot of a (in my case Running) VM
      2. Shut down VM
      3. Use UI to Restore to snapshot

      Actual results:
      VM is indefinitely in pending state, no log activity in cdi uploader pod

      Expected results:
      VM is recreated with PVC referencing VolumeSnapshot, and back-end snapshot class handles restore; VM starts quickly.

      Additional info:

      — Additional comment from Michael Henriksen on 2022-03-31 12:18:02 UTC —

      vm/datavolume/pvc yamls pre and post restore would be very helpful

      — Additional comment from Chandler Wilkerson on 2022-03-31 13:51:44 UTC —

      Pre:

      VM: http://pastebin.test.redhat.com/1041370
      DV: http://pastebin.test.redhat.com/1041371
      PVC: http://pastebin.test.redhat.com/1041369

      — Additional comment from Chandler Wilkerson on 2022-03-31 13:57:32 UTC —

      Post restore:

      VM: http://pastebin.test.redhat.com/1041376
      DV: http://pastebin.test.redhat.com/1041373
      PVC: http://pastebin.test.redhat.com/1041374

      — Additional comment from Michael Henriksen on 2022-04-01 00:07:14 UTC —

      This issue should only affect DataVolumes created via network clone operations.

      Is a regression introduced in this PR: https://github.com/kubevirt/containerized-data-importer/pull/1922

      — Additional comment from Michael Henriksen on 2022-04-01 00:53:57 UTC —

      Somewhat related to this PR in progress: https://github.com/kubevirt/containerized-data-importer/pull/2205

      — Additional comment from Bartosz Rybacki on 2022-05-30 09:25:04 UTC —

      Michael, Adam, I think we should update this bug.

      There are two fixes [1] [2] that were merged to CDI 1.49 which landed in OpenShift 4.11. Do we want to backport it to 1.43 so it fixes the bug in 4.10?

      [1] https://github.com/kubevirt/containerized-data-importer/pull/2205
      [2] https://github.com/kubevirt/containerized-data-importer/pull/2227

      Attachments

        Issue Links

          Activity

            People

              alitke@redhat.com Adam Litke
              alitke@redhat.com Adam Litke
              Kevin Alon Goldblatt Kevin Alon Goldblatt
              Votes:
              0 Vote for this issue
              Watchers:
              3 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: