Uploaded image for project: 'Hybrid Cloud Console'
  1. Hybrid Cloud Console
  2. RHCLOUD-45029

Kessel Self-SRE & Incident Response

XMLWordPrintable

    • Kessel Self-SRE Migration & Incident Response Implementation
    • Product / Portfolio Work
    • False
    • Hide

      None

      Show
      None
    • False
    • Unset
    • To Do
    • 100% To Do, 0% In Progress, 0% Done

      Review the CRCPLAN parent feature for additional context, including the feature overview, goals, user stories and use cases, acceptance criteria, designs, dependencies, risks, assumptions, pending questions and documentation callouts.

      Summary and goal

      Enables the Kessel team to take full operational ownership of our services under the self-sre initiative. We are moving from a shared support model to a dedicated "on-call" structure.

      Migration guide: https://docs.google.com/document/d/11sF48rQh8IF5x6S0ehbY5KY43b33KIokhY4nbNdoa5I/edit?tab=t.0#heading=h.48gmm8qtpswy

      Acceptance Criteria 

      1. Kessel team can self-service cluster of infrastructure admin needs
      2. Kessel SM process is enhanced to utilize pagerduty alerting during business hours

      Checklist

      Checklist Item Required Notes or Comments
      Workstream or external team dependencies? N  
      ADR Required? 
      • Long-form (approval)
      • Short-form (informational)
      N  
      Testing plans
      • New automation or update existing?
      N  
      Known dependencies? 
      • Link to the dependent Jiras
      • Add details
      N  

      Open Questions

      Capture any open questions and resolutions related to the epic goal or acceptance criteria. Add any additional details, questions or decisions that need to be made or addressed. 

              Unassigned Unassigned
              rh-ee-tcreller Tyler Creller
              Votes:
              0 Vote for this issue
              Watchers:
              1 Start watching this issue

                Created:
                Updated: