-
Feature
-
Resolution: Unresolved
-
Major
-
None
-
None
-
Strategic Product Work
-
False
-
-
False
-
OCPSTRAT-1408Comprehensive Automation of HCP Backup and Restore Processes for Enhanced Scalability and Reliability
-
100% To Do, 0% In Progress, 0% Done
-
M
-
6
-
0
-
Program Call
-
-
Needs TE because this is a new feature that will help increase cluster resiliency.
-
Feature Overview (Goal Summary)
This is a continuation of the work done in https://issues.redhat.com/browse/OCPSTRAT-989 which introduced the integration of OADP to do backup and restore. The idea here is to introduce a hypershift OADP plugin on top of the existing integration to automate the capture of Hypershift resources (e.g., Hosted Cluster and Nodepool).
Goals (Expected User Outcomes)
- Primary User Persona: Cluster Service Providers
- User Benefit: Enhanced data protection and quicker disaster recovery for Hosted Clusters clusters through automated etcd snapshots.
Requirements (Acceptance Criteria)
- Automatic Snapshot Creation: Etcd snapshots must be taken automatically at regular intervals.
- S3 Storage: Support for any S3-compatible storage for snapshot storage.
- Snapshot Rotation and Retention Policy: Snapshots are rotated/removed after a specified period to manage storage efficiently.
- Restoration SOP: Standard Operating Procedures for Etcd restoration should be established, targeting a recovery time objective (RTO) of approximately 1 hour at max. Preferrearbly automated as well.
- Metrics: Track Mean Time to Recovery (MTTR) for improved reliability. Do we have metrics?
- clones
-
OCPSTRAT-989 Backup/restore for Hosted Clusters for Self-Managed HCP Part I
- Closed