Uploaded image for project: 'OpenShift API for Data Protection'
  1. OpenShift API for Data Protection
  2. OADP-1016

DataMover: Restore randomly fails with "secrets vsr-lttsv-secret already exists" error

    XMLWordPrintable

Details

    • False
    • Hide

      None

      Show
      None
    • False
    • oadp-volume-snapshot-mover-container-1.1.1-24
    • ToDo
    • No
    • 0
    • 0
    • Very Likely
    • 0
    • None
    • Unset
    • Unknown

    Description

      Description of problem: Restore started to fail randomly with "secrets vsr-lttsv-secret already exists" error. 

      Version-Release number of selected component (if applicable):
      GCP 4.11
      OADP 1.1.1 (oadp-operator-bundle-container-1.1.1-38)

      How reproducible:
      Intermittently
       
      Steps to Reproduce:

      1. Deploy an application with multiple PVC
      2. Create a backup resource
      3. Delete ns and run restore
      4. Repeat step no.3 until you hit the issue

      Actual results:

      Restore partiallyFailed.

      status:
        completionTimestamp: "2022-11-03T06:58:26Z"
        errors: 1
        phase: PartiallyFailed
        progress:
          itemsRestored: 41
          totalItems: 41
        startTimestamp: "2022-11-03T06:57:22Z"
        warnings: 4

      VSR 

        conditions:
        - lastTransitionTime: "2022-11-03T06:57:44Z"
          message: secrets "vsr-lttsv-secret" already exists
          reason: Error
          status: "False"
          type: Reconciled
        phase: Failed

      VSM logs

      2022-11-03T06:57:44Z DEBUG events Normal {"object":
      {"kind":"Secret","namespace":"openshift-adp","name":"vsr-lttsv-secret","uid":"a8086fa3-96c4-4b49-b4c6-75419da18568","apiVersion":"v1","resourceVersion":"189889"}
      , "reason": "ResticSecretRestoreReconciled", "message": "created restic secret vsr-lttsv-secret"}
      2022-11-03T06:57:44Z INFO Error from batch reconcile: secrets "vsr-lttsv-secret" already exists {"controller": "volumesnapshotrestore", "controllerGroup": "datamover.oadp.openshift.io", "controllerKind": "VolumeSnapshotRestore", "volumeSnapshotRestore":
      {"name":"vsr-lttsv","namespace":"bug-849"}
      , "namespace": "bug-849", "name": "vsr-lttsv", "reconcileID": "44a88509-5b89-463a-a96f-410dcdd0c227", "vsr": "bug-849/vsr-lttsv"}
      

       

      Expected results:
      Restore should pass.

       

      Additional info:

      Attachments

        Activity

          People

            emcmulla@redhat.com Emily McMullan
            rhn-support-prajoshi Prasad Joshi
            Votes:
            0 Vote for this issue
            Watchers:
            7 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: