Uploaded image for project: 'OpenStack as Infra'
  1. OpenStack as Infra
  2. OSASINFRA-4090

Investigate failing cinder-csi-operator e2e-openstack jobs when using Vexxhost cloud

XMLWordPrintable

    • Icon: Spike Spike
    • Resolution: Unresolved
    • Icon: Normal Normal
    • openshift-4.22
    • None
    • None
    • ShiftStack Sprint 284

      In the initial attempt to migrate cinder jobs to run on Vexxhost the following jobs failed:

      • ci/rehearse/openshift/csi-operator/main/e2e-openstack-cinder-csi
      • ci/rehearse/openshift/csi-operator/main/e2e-openstack

      The e2e-openstack-cinder-csi job, tests successfully validate volume creation, snapshotting, and restoration, but hang indefinitely when attempting to delete the source PersistentVolume (PV).

      Looking at the logs the test is hanging on this step where eventually it timeouts

      I1014 12:53:44.631129 1956 pv.go:205] Deleting PersistentVolumeClaim "cinder.csi.openstack.org5vccv"
      I1014 12:53:44.654166 1956 pv.go:863] Waiting up to 20m0s for PersistentVolume pvc-e03f12c8-ac22-4d18-bbdb-7400cc701aa6 to get deleted
      I1014 12:53:44.671860 1956 pv.go:867] PersistentVolume pvc-e03f12c8-ac22-4d18-bbdb-7400cc701aa6 found and phase=Bound (17.628576ms)
      I1014 12:53:49.693220 1956 pv.go:867] PersistentVolume pvc-e03f12c8-ac22-4d18-bbdb-7400cc701aa6 found and phase=Released (5.038994257s)
      I1014 12:53:54.713838 1956 pv.go:867] PersistentVolume pvc-e03f12c8-ac22-4d18-bbdb-7400cc701aa6 found and phase=Released (10.059610359s)
      I1014 12:53:59.735720 1956 pv.go:867] PersistentVolume pvc-e03f12c8-ac22-4d18-bbdb-7400cc701aa6 found and phase=Released (15.081492182s)
      I1014 12:54:04.758832 1956 pv.go:867] PersistentVolume pvc-e03f12c8-ac22-4d18-bbdb-7400cc701aa6 found and phase=Released (20.104605448s)

      This needs to be investigated further to understand the reason the test hangs at the delete resource.

      To see if its to-do with the Vexxhost cloud environment having issues with Cinder back-end processing volume deletions or the need to change other configuration on this cloud as the snapshot service does not seem to be fully working.

              rh-ee-dlawton Daniel Lawton
              rh-ee-dlawton Daniel Lawton
              Votes:
              0 Vote for this issue
              Watchers:
              1 Start watching this issue

                Created:
                Updated: