Loading...

XML

Word

Printable

Type: Bug
Resolution: Done
Priority: Blocker
Fix Version/s: OADP 1.1.2
Affects Version/s: OADP 1.1.2
Component/s: restic, velero
Labels:

Blocked:
False
Blocked Reason:

Hide

None

Show
None
Ready:
False
Fixed in Build:
oadp-velero-container-1.1.2-12
QEStatus:
ToDo
Intelligence Requested:
Market:

Cost of Delay:
0
WSJF:
0
Risk Probability:
Very Likely
Risk Score:
0

Workstream:

None

Root Cause:
Unset
Failure Category:
Unknown

Release Blocker:
Approved

Regression:
Yes

SFDC Cases Links:
SFDC Cases Counter:
SFDC Cases Open:

Description of problem:

Velero backup stays in progress status after restic pod is restarted due to OOM killed, before this build oadp-operator-bundle-container-1.1.2-14 test passed as usual but now it started failing. Attached report portal link below.

https://reportportal-migration-qe.apps.ocp-c1.prod.psi.redhat.com/ui/#oadp/launches/all/2689/95281/log

Upstream PR: https://github.com/vmware-tanzu/velero/pull/4893

Version-Release number of selected component (if applicable):

OADP 1.1.2

Build :- oadp-operator-bundle-container-1.1.2-16

How reproducible:

Always
Failing consistently.

Steps to Reproduce:

Polarion case :- https://polarion.engineering.redhat.com/polarion/redirect/project/OADP/workitem?id=OADP-231

1. Create a dpa CR with low restic limit resource

apiVersion: oadp.openshift.io/v1alpha1
kind: DataProtectionApplication
metadata:
  name: ts-dpa
  namespace: openshift-adp
spec:
  backupLocations:
  - velero:
      credential:
        key: cloud
        name: cloud-credentials-gcp
      default: true
      objectStorage:
        bucket: oadpbucket163761
        prefix: velero-e2e-50e5ea53-7a22-11ed-b0bf-845cf3eff33a
      provider: gcp
  configuration:
    restic:
      enable: true
      podConfig:
        resourceAllocations:
          limits:
            cpu: 100m
            memory: 50Mi
          requests:
            cpu: 50m
            memory: 10Mi
    velero:
      defaultPlugins:
      - openshift
      - gcp
      - kubevirt

2. Create a restic backup

Actual results:

Backup got stuck in inprogress status.

$ oc get podvolumebackup
NAME                                                 STATUS       CREATED   NAMESPACE       POD                  VOLUME            REPOSITORY ID                                                                               UPLOADER TYPE   STORAGE LOCATION   AGE
backup1-53b48381-7a22-11ed-b0bf-845cf3eff33a-bndxk   InProgress   11m       test-oadp-591   postgresql-1-hf7js   postgresql-data   gs:oadpbucket163761:/velero-e2e-ebeca73d-79f2-11ed-941e-0a58ac1e09e0/restic/test-oadp-591   restic          ts-dpa-1           11m

Expected results:
PodVolumeBackup should be marked as Failed in case of restic pod restart. Also backup should be marked as partiallyFailed.

Additional info:

is related to

OADP-1078 Backup gets stuck InProgress status after restic pod gets restarted

Closed

links to

openshift/velero#238: oadp-1.1: OADP-1256 Use updated PVB/PVR for patching Failed Phase during startup

velero release-1.9 PR

1.	[RedHat QE] Verify Bug OADP-1256 - Backup stays in progress status after restic pod is restarted due to OOM killed	Closed	Prasad Joshi
2.	[IBM QE-P] Verify Bug OADP-1256 - Backup stays in progress status after restic pod is restarted due to OOM killed	Release Pending	Sonia Garudi
3.	[IBM QE-Z] Verify Bug OADP-1256 - Backup stays in progress status after restic pod is restarted due to OOM killed	Release Pending	SHIVA SAI K (Inactive)

Assignee:: Tiger Kaovilai

Reporter:: Prasad Joshi

Votes:: 0 Vote for this issue

Watchers:: 9 Start watching this issue

Created:: 2023/02/03 12:24 PM

Updated:: 2024/04/01 1:27 PM

Resolved:: 2023/03/09 1:25 AM

Details

Description

Description of problem:

Upstream PR: https://github.com/vmware-tanzu/velero/pull/4893

2. Create a restic backup

Actual results:

Attachments

Issue Links

Easy Agile Planning Poker

Sub-Tasks

Activity

People

Dates