-
Task
-
Resolution: Unresolved
-
Normal
-
None
-
None
-
None
-
None
-
Product / Portfolio Work
-
8
-
False
-
-
False
-
Not Selected
-
NEW
-
NEW
-
-
-
MON Sprint 280
Support in-cluster monitoring components under the telemetry profile. The effort entails putting all such recording rules into dedicated SMs for each component. These are:
- '{__name__=~"cluster:usage:.*"}'
- '{__name__="count:up0"}'
- '{__name__="count:up1"}'
- '{__name__="ALERTS",alertstate="firing",severity=~"critical|warning|info|none"}'
- '{__name__="cluster:capacity_cpu_cores:sum"}'
- '{__name__="cluster:capacity_memory_bytes:sum"}'
- '{__name__="cluster:cpu_usage_cores:sum"}'
- '{__name__="cluster:memory_usage_bytes:sum"}'
- '{__name__="openshift:cpu_usage_cores:sum"}'
- '{__name__="openshift:memory_usage_bytes:sum"}'
- '{__name__="workload:cpu_usage_cores:sum"}'
- '{__name__="workload:memory_usage_bytes:sum"}'
- '{__name__="cluster:virt_platform_nodes:sum"}'
- '{__name__="cluster:node_instance_type_count:sum"}'
- '{__name__="node_role_os_version_machine:cpu_capacity_cores:sum"}'
- '{__name__="node_role_os_version_machine:cpu_capacity_sockets:sum"}'
- '{__name__="cluster:alertmanager_integrations:max"}'
- '{__name__="cluster:telemetry_selected_series:count"}'
- '{__name__="openshift:prometheus_tsdb_head_series:sum"}'
- '{__name__="openshift:prometheus_tsdb_head_samples_appended_total:sum"}'
- '{__name__="monitoring:container_memory_working_set_bytes:sum"}'
- '{__name__="namespace_job:scrape_series_added:topk3_sum1h"}'
- '{__name__="namespace_job:scrape_samples_post_metric_relabeling:topk3"}'
- '{__name__="profile:cluster_monitoring_operator_collection_profile:max"}'
- '{__name__="vendor_model:node_accelerator_cards:sum",vendor=~"NVIDIA|AMD|GAUDI|INTEL|QUALCOMM|Marvell|Mellanox"}'