-
Feature Request
-
Resolution: Unresolved
-
Normal
-
None
-
None
-
None
- Proposed title of this feature request
Ask to add support for SLOTH in ROSA HCP for user workloads
2. What is the nature and description of the request?
Enhancement to monitoring for user services/workloads
3. Why does the customer need this? (List the business requirements here)
Sloth is used to monitor the following Service Level Objectives. Each SLO (defined as http://sloth.slok.dev/v1/PrometheusServiceLevel) generates a PrometheusRule which defines a desired set of Prometheus alerting and/or recording rules. Alerts are actively viewed and handled by the pipeline on-call staff
- Latency requitements for Produce Requests (95 percentile under 50ms)
- Reliability of Kafka Requests (99.9 objective) using kafka broker metrics
- Reliability of Kafka Requests (99.9 objective) using kafka-monitor requests
- This SLO measures the availability of API requests at load balancer (envoy) level (99.5 objective).
- This SLO measures the latency for produce requests in API, measured at load balancer (envoy) level. (95 objective)
- This SLO measures the latency for produce requests in API, measured at load balancer (envoy) level (95 objective)
4. List any affected packages or components.