Uploaded image for project: 'Red Hat Advanced Cluster Management'
  1. Red Hat Advanced Cluster Management
  2. ACM-9780

[MDR]Cluster namespace is in Terminating state after hub recovery

XMLWordPrintable

    • Moderate
    • No

      Description of problem:

       

      After performing hub recovery I am not able to import cluster as cluster namespace in Terminating state.

      hub-passive oc get project | grep asagare
      asagare-c1                                                        Terminating
      asagare-c2                                                        Terminating

       

      Hub restore process is not completed successfully.  
      ➜ hub-passive oc get restore -A
      NAMESPACE NAME PHASE MESSAGE
      open-cluster-management-backup restore-acm FinishedWithErrors Velero restores have run to completion but encountered 1+ errors.
       
      here is the restore resource
      ➜ hub-passive oc get restore -A -o yaml
      apiVersion: v1
      items:

      • apiVersion: cluster.open-cluster-management.io/v1beta1
        kind: Restore
        metadata:
        creationTimestamp: "2024-02-01T15:13:47Z"
        generation: 1
        name: restore-acm
        namespace: open-cluster-management-backup
        resourceVersion: "498190"
        uid: af8dfc42-fdcd-48d5-9426-360b9c4d4c81
        spec:
        cleanupBeforeRestore: CleanupRestored
        veleroCredentialsBackupName: latest
        veleroManagedClustersBackupName: latest
        veleroResourcesBackupName: latest
        status:
        lastMessage: Velero restores have run to completion but encountered 1+ errors
        phase: FinishedWithErrors
        veleroCredentialsRestoreName: restore-acm-acm-credentials-schedule-20240201150043
        veleroGenericResourcesRestoreName: restore-acm-acm-resources-generic-schedule-20240201150043
        veleroManagedClustersRestoreName: restore-acm-acm-managed-clusters-schedule-20240201150043
        veleroResourcesRestoreName: restore-acm-acm-resources-schedule-20240201150043
        kind: List
        metadata:
        resourceVersion: ""

        Version-Release number of selected component (if applicable):

      OCP-4.15

      ODF-4.15

      AcM- 2.9.2

      How reproducible:

      Steps to Reproduce:

        1. Create MDR env with active hub , passive hub, 2 managed cluster
        2. Create workloads on both managed cluster
        3. Performed neutral site(powered off only active hub) based hub recovery  mentioned in below doc 
        4. verified restore resource.
        5. Import cluster using secrets.

      https://docs.google.com/document/d/1DOlkuKpbZJyzWnhll1-pj0jL3dJSzPFY8rvWieMqZU8/edit#heading=h.8hwvhfin7z29 

       

      Attached secret file .

       

      Actual results:

      restore resource failed with error "
      Velero restores have run to completion but encountered 1+ errors
      "

      Expected results:

      Restore resource status should Finished.

      Additional info:

              jiazhu@redhat.com Jian Zhu
              rh-ee-asagare Avdhoot Sagare
              Hui Chen Hui Chen
              Votes:
              0 Vote for this issue
              Watchers:
              15 Start watching this issue

                Created:
                Updated:
                Resolved: