Loading...

XML

Word

Printable

Type: Bug
Resolution: Duplicate
Priority: Undefined
Fix Version/s: None
Affects Version/s: 2.6.2
Component/s: None
Labels:
None

Blocked:
False
Blocked Reason:
None
Ready:
False
Intelligence Requested:
Market:

Severity:
Moderate

SFDC Cases Links:
SFDC Cases Counter:
SFDC Cases Open:

Components:

OCP 4.15.11
OCV 4.15.2
MTV Operator 2.6.2

While migrating a VM from RHV to OpenShift, scratch pvcs are left bound and in use as the importer pods are not cleaned up. Additionally one pvc is left in Lost state, as it is in https://issues.redhat.com/browse/MTV-1238 We note that the importer pod cleanup does not happen.

After migration:

$oc get pods  
NAMESPACE NAME READY STATUS RESTARTS AGE
default importer-prime-9c4ca6df-b651-4ab6-8a11-96a2a7f656e9-checkpoint-4f5bcb18-7f69-4e2c-915c-d112ac0dd203 0/1 Completed 0 1h
default importer-prime-9c4ca6df-b651-4ab6-8a11-96a2a7f656e9-checkpoint-8df7d2c4-4290-4d98-b5b8-a32577b0f55d 0/1 Completed 0 1h
default virt-launcher-ovirt-testvm2-rkzdg 1/1 Running 0 1h

$ oc get pvc -n default
NAME                                                        STATUS   VOLUME                                     CAPACITY       ACCESS MODES   STORAGECLASS      AGE
prime-9c4ca6df-b651-4ab6-8a11-96a2a7f656e9                  Lost     pvc-6cfe9aa7-7c8d-4bc0-8f07-4117cd408838   0                             ocs-storagecluster-ceph-rbd   1h
prime-9c4ca6df-b651-4ab6-8a11-96a2a7f656e9-scratch          Bound    pvc-dbedc23a-a8aa-4247-9952-e20cff859655   284058683598   RWO            ocs-storagecluster-ceph-rbd   1h
testwarm-july2-84e9514e-bd7d-495f-8b72-4a67f0ea02e1-xwsrv   Bound    pvc-6cfe9aa7-7c8d-4bc0-8f07-4117cd408838   284058683598   RWX            ocs-storagecluster-ceph-rbd   1h

This is seen on Netapp trident storage (customer), as well as ocs storage(in our lab). However it is not reproducible using nfs-subdir-external-provisioner(unsupported, but used in testing)

When the migration is done using the nfs provisioner, the importer pods are removed, the prime-XXX-scratch pvc is removed, and everything looks clean. In this case we see something like the following in the forklift-controller logs:

{"level":"info","ts":"2024-07-02 20:50:57.861","logger":"plan|xnhwb","msg":"Deleted importer pod.","plan":{"name":"shaggymigrated","namespace":"openshift-mtv"},"migration":"openshift-mtv/shaggymigrated-tfthd","pod":"openshift-mtv/

When warm migrating to Netapp or ODF storage on OpenShift, we don't see the "Deleted importer pod" messages.

While we know there is a manual work around to delete the importer pods manually, that's a lot of extra work especially when migrating many VMs at once. Additionally it appears to strand pvcs which consume space on the backend storage.

duplicates

MTV-1095 Leftover PVCs in Lost state after cold migrations

Closed

Assignee:: Arik Hadas

Reporter:: Sean Haselden

Votes:: 0 Vote for this issue

Watchers:: 3 Start watching this issue

Created:: 2024/07/03 8:00 PM

Updated:: 2024/07/04 8:36 AM

Resolved:: 2024/07/04 8:36 AM

Details

Description

Attachments

Issue Links

Easy Agile Planning Poker

Activity

People

Dates