Loading...

XML

Word

Printable

Type: Story
Resolution: Unresolved
Priority: Undefined
Fix Version/s: OADP 1.6.0
Affects Version/s: None
Component/s: None
Labels:
- oadp_upstream_milestone_v1.18
- triaged

Activity Type:
Product / Portfolio Work
Story Points:
3
Blocked:
False
Blocked Reason:

Hide

None

Show
None
Ready:
False
Color Status:
Not Selected
QEStatus:
ToDo
Intelligence Requested:
Market:

Risk Probability:
Very Likely
Risk Score:
0

Workstream:

None

Root Cause:
Unset
Failure Category:
Unknown

Regression:
None

SFDC Cases Links:
SFDC Cases Open:
SFDC Cases Counter:

PX Impact Score:

This issue tracks the upstream Velero GitHub issue #9225 which is part of the Velero v1.18 milestone.

Description

I would like a better way to keep track of maintenance Job success and failures.

Currently the way was by tracking Job object success with the number of Jobs were set from --keep-latest-maintenance-jobs. Maintenance job failure will eventually cause severe performance degradation of backups over time as kopia based backups continue to succeed.

This argument is going away in Velero 1.17. I would like to propose an alternative solution.

The proposal is to add publishing prometheus metrics as a way of keeping tracking of maintenance job success and failures.

In addition, some cloud providers provide software to trigger emails and other alerts off of Prometheus metrics.

Prometheus metrics are already published regarding dataupload/datadownload.

Interested values:

success in context of what maintenance job to BackupRepository correspondance
failure in the same context as success
execution time of the Job

Upstream Details

GitHub Issue: https://github.com/vmware-tanzu/velero/issues/9225
Status: Open
Assignee: shubham-pampattiwar
Labels: Metrics
Created: 2025-09-04T15:40:36Z
Updated: 2025-09-19T06:23:42Z

This addition makes sense in light of the upcoming Velero 1.17 changes.

Assignee:: Shubham Pampattiwar

Reporter:: Wes Hayutin

Votes:: 0 Vote for this issue

Watchers:: 1 Start watching this issue

Created:: 2025/09/24 5:14 PM

Updated:: 2026/01/05 8:06 PM

Details

Description

Description

Upstream Details

Attachments

Easy Agile Planning Poker

Activity

People

Dates

Hide