-
Task
-
Resolution: Done
-
Major
-
None
-
None
-
None
As a RH associate, I want to observe OpenShift Logging Log Store metrics across the Elasticsearch fleet, so that I can inform my BU, eng team, etc. about how features are used or which problem areas exists or how big the clusters are.
Acceptance criteria
- Feature usage in the Elasticsearch Customer Resource: Expose metrics from EO on which feature are used (e.g. persistent vs. ephemeral storage, multi-node clusters with more than 3 data nodes, Types of Redundancy used)
- Problem areas: index management policies, retention per policy (app, infra, audit), request/limits not equal
- Sizing metrics from ES metrics: Telemetry on active indices/shards
Dev Notes
- Outcome is a google doc describing 5-10 metrics in total for the three acceptance criteria.
- We don't need to be conclusive nor complete
- Similar implementation for Loki just for CR metrics: https://github.com/ViaQ/loki-operator/pull/32 (Take this as inspiration only)