Type: Story
Resolution: Done
Priority: Major
Fix Version/s: Logging 6.0.0
Affects Version/s: None
Component/s: Log Collection
Labels:
- groomed

Story Points:
3
Blocked:
False
Blocked Reason:
None
Ready:
False
Epic Link:
Log Collection 6.0 Tech Debt
Docs QE Status:
NEW
QE Status:
NEW
Release Note Text:

Hide
This feature introduces an alert to trigger when log collector's are buffering logs to a cluster node's file system and the buffers are consuming more then 15% of the available space. This is a possible indicator of the log collectors experiencing back pressure from their configured outputs and that administrators should take action to keep the collectors from potentially destabilizing the cluster node

Show
This feature introduces an alert to trigger when log collector's are buffering logs to a cluster node's file system and the buffers are consuming more then 15% of the available space. This is a possible indicator of the log collectors experiencing back pressure from their configured outputs and that administrators should take action to keep the collectors from potentially destabilizing the cluster node
Release Note Type:
Feature
Intelligence Requested:
Market:

Sprint:
Log Collection - Sprint 252, Log Collection - Sprint 253

SFDC Cases Links:
SFDC Cases Open:
SFDC Cases Counter:

Summary

Add Alerts that should be fired in case collectors potentially consuming too much node disk space.
Add Metrics Dashboard that will be show consuming space Vector Output Buffer on each node.

Acceptance Criteria

1. Implement Alerting:

Set up alerting rules to trigger alerts when the space consumed by the Vector Output Buffer exceeds 15% of the total disk space on each node.
Test the alerting system to ensure that alerts are fired appropriately when the criteria are met.

2. Create Grafana Dashboards:

Panel 1: Display the absolute size of the Vector buffer via a graph by instance.
Panel 2: Display the percentage of buffer size relative to the total disk space on the node.

Configure the panels in each dashboard to visualize the required metrics accurately.

3. Update Documentation

Notes

We must understand if it is possible for these alerts to be enabled for non-infra namespaces and how to do that. I believe there is no way for non-infra namespaces to be opted into cluster metrics

is cloned by

LOG-5586 Implement Alerts and Metrics Dashboard for Vector Output Buffer

Closed

is related to

LOG-5289 Investigate Output Buffer Alerts and Metrics

Closed

relates to

OBSDOCS-1139 Implement Alerts and Metrics Dashboard for Vector Output Buffer

Closed

links to

openshift/cluster-logging-operator#2438: LOG-5381: add alerts metric and dashboard panels

openshift/cluster-logging-operator#2462: [release-5.9] LOG-5381: add alerts metric and dashboard panels

openshift/cluster-logging-operator#2463: LOG-5381: fix bundle to include alert

openshift/cluster-logging-operator#2502: [release-5.9] LOG-5586: add alerts metric and dashboard panels

openshift/openshift-docs#82065: OBSDOCS-1101: Logging 6.0 Release Notes

RHBA-2024:137361 Logging for Red Hat OpenShift - 6.0.0

mentioned on

Merge request - Updated 2 upstream sources

Merge request - Updated 3 upstream sources

Merge request - Updated US source to: 0b4f2e6 LOG-4990: Initial impl of CLF.observability API

Merge request - Updated US source to: 0f2e265 Fix NPE is output type spec not declared

Merge request - Updated US source to: 02fb449 Fix NPE if inputs spec not declared

Merge request - Updated US source to: 4eb0ca5 LOG-5474: Fix CLO rotate_wait_ms to rotate_wait_secs

Merge request - Updated US source to: 6ff3445 LOG-5515: Refactor syslog to use OBS API

Merge request - Updated US source to: 07e42da LOG-5367: Enable generator unit tests

Merge request - Updated US source to: 9a06d9b LOG-5569: Add managementState to obs API

Merge request - Updated US source to: 9d1938d LOG-5601: Stub migration and validation into controller

Merge request - Updated US source to: 37feb80 LOG-5381: add alerts metric and dashboard panels

Merge request - Updated US source to: 38a8414 Fix build issue in midstream CLO

Merge request - Updated US source to: 57c0a64 LOG-5381: fix bundle to include alert. fix threshold value

Merge request - Updated US source to: 79e2c14 Address CVE-2023-45288

Merge request - Updated US source to: 131c034 LOG-5515: Removed unused TLS templates

Merge request - Updated US source to: 981ec0c LOG-5621: Add pipeline validations

Merge request - Updated US source to: 1977791 LOG-5131: Remove old telemetry implementation from development branch

Merge request - Updated US source to: ae3452e LOG-5605: Add input secret validation

Merge request - Updated US source to: b8fe0df LOG-5515: Refactor splunk to OBS API

Merge request - Updated US source to: be906d2 LOG-5515: Refactor azuremonitor output to obs

Merge request - Updated US source to: c6c896d LOG-5381: add alerts metric and dashboard panels

Merge request - Updated US source to: d7a9dfc LOG-5527: refactor infra, audit, receiver inputs to OBS API

Merge request - Updated US source to: ee2fcc9 LOG-5558: Add support for tls from configmap

Merge request - Updated US source to: f9c30d9 LOG-2811: Add lokistack output type to cluster forwarder

(4 links to, 24 mentioned on)

Errata Tool added a comment - 2024/09/24 3:25 PM

Since the problem described in this issue should be resolved in a recent advisory, it has been closed.

For information on the advisory (Logging for Red Hat OpenShift - 6.0.0), and where to find the updated files, follow the link below.

If the solution does not work for you, open a new bug report.
https://access.redhat.com/errata/RHBA-2024:6693

Errata Tool added a comment - 2024/09/24 3:25 PM Since the problem described in this issue should be resolved in a recent advisory, it has been closed. For information on the advisory (Logging for Red Hat OpenShift - 6.0.0), and where to find the updated files, follow the link below. If the solution does not work for you, open a new bug report. https://access.redhat.com/errata/RHBA-2024:6693

GitLab CEE Bot added a comment - 2024/06/22 3:45 AM

CPaaS Service Account mentioned this issue in a merge request of openshift-logging / Log Collection Midstream on branch openshift-logging-6.0-rhel-9_upstream_01bfca2d8cacccf01239fb09e7112e9d:

Updated 3 upstream sources

GitLab CEE Bot added a comment - 2024/06/22 3:45 AM CPaaS Service Account mentioned this issue in a merge request of openshift-logging / Log Collection Midstream on branch openshift-logging-6.0-rhel-9_ upstream _01bfca2d8cacccf01239fb09e7112e9d : Updated 3 upstream sources

GitLab CEE Bot added a comment - 2024/06/19 3:45 AM

CPaaS Service Account mentioned this issue in a merge request of openshift-logging / Log Collection Midstream on branch openshift-logging-6.0-rhel-9_upstream_ad90d8fb3b89a0c7e4d601840eccddaa:

Updated 2 upstream sources

GitLab CEE Bot added a comment - 2024/06/19 3:45 AM CPaaS Service Account mentioned this issue in a merge request of openshift-logging / Log Collection Midstream on branch openshift-logging-6.0-rhel-9_ upstream _ad90d8fb3b89a0c7e4d601840eccddaa : Updated 2 upstream sources

GitLab CEE Bot added a comment - 2024/06/11 3:44 AM

CPaaS Service Account mentioned this issue in a merge request of openshift-logging / Log Collection Midstream on branch openshift-logging-6.0-rhel-9_upstream_1ddb486521e9f822fe28906aa13c853c:

Updated US source to: 981ec0c ~~LOG-5621~~: Add pipeline validations

GitLab CEE Bot added a comment - 2024/06/11 3:44 AM CPaaS Service Account mentioned this issue in a merge request of openshift-logging / Log Collection Midstream on branch openshift-logging-6.0-rhel-9_ upstream _1ddb486521e9f822fe28906aa13c853c : Updated US source to: 981ec0c LOG-5621 : Add pipeline validations

GitLab CEE Bot added a comment - 2024/06/04 3:44 AM

CPaaS Service Account mentioned this issue in a merge request of openshift-logging / Log Collection Midstream on branch openshift-logging-6.0-rhel-9_upstream_d6a0258de348d197c69f857a2f67bf12:

Updated US source to: ae3452e ~~LOG-5605~~: Add input secret validation

GitLab CEE Bot added a comment - 2024/06/04 3:44 AM CPaaS Service Account mentioned this issue in a merge request of openshift-logging / Log Collection Midstream on branch openshift-logging-6.0-rhel-9_ upstream _d6a0258de348d197c69f857a2f67bf12 : Updated US source to: ae3452e LOG-5605 : Add input secret validation

GitLab CEE Bot added a comment - 2024/06/01 3:44 AM

CPaaS Service Account mentioned this issue in a merge request of openshift-logging / Log Collection Midstream on branch openshift-logging-6.0-rhel-9_upstream_6e600b06a4ee53e863c48be2ecacda67:

Updated US source to: 9d1938d ~~LOG-5601~~: Stub migration and validation into controller

GitLab CEE Bot added a comment - 2024/06/01 3:44 AM CPaaS Service Account mentioned this issue in a merge request of openshift-logging / Log Collection Midstream on branch openshift-logging-6.0-rhel-9_ upstream _6e600b06a4ee53e863c48be2ecacda67 : Updated US source to: 9d1938d LOG-5601 : Stub migration and validation into controller

GitLab CEE Bot added a comment - 2024/05/31 7:39 AM

CPaaS Service Account mentioned this issue in a merge request of openshift-logging / Log Collection Midstream on branch openshift-logging-5.9-rhel-9_upstream_4fc12aa7c355b54f83dc063f0b42b363:

Updated US source to: 37feb80 ~~LOG-5381~~: add alerts metric and dashboard panels

GitLab CEE Bot added a comment - 2024/05/31 7:39 AM CPaaS Service Account mentioned this issue in a merge request of openshift-logging / Log Collection Midstream on branch openshift-logging-5.9-rhel-9_ upstream _4fc12aa7c355b54f83dc063f0b42b363 : Updated US source to: 37feb80 LOG-5381 : add alerts metric and dashboard panels

GitLab CEE Bot added a comment - 2024/05/30 4:09 PM

CPaaS Service Account mentioned this issue in a merge request of openshift-logging / Log Collection Midstream on branch openshift-logging-6.0-rhel-9_upstream_a5cef1cc78682ab8fe6f23270c9412cb:

Updated US source to: 02fb449 Fix NPE if inputs spec not declared

GitLab CEE Bot added a comment - 2024/05/30 4:09 PM CPaaS Service Account mentioned this issue in a merge request of openshift-logging / Log Collection Midstream on branch openshift-logging-6.0-rhel-9_ upstream _a5cef1cc78682ab8fe6f23270c9412cb : Updated US source to: 02fb449 Fix NPE if inputs spec not declared

GitLab CEE Bot added a comment - 2024/05/30 9:14 AM

CPaaS Service Account mentioned this issue in a merge request of openshift-logging / Log Collection Midstream on branch openshift-logging-6.0-rhel-9_upstream_a96f0657fd44c2e14a4fcd3618dc6470:

Updated US source to: 0f2e265 Fix NPE is output type spec not declared

GitLab CEE Bot added a comment - 2024/05/30 9:14 AM CPaaS Service Account mentioned this issue in a merge request of openshift-logging / Log Collection Midstream on branch openshift-logging-6.0-rhel-9_ upstream _a96f0657fd44c2e14a4fcd3618dc6470 : Updated US source to: 0f2e265 Fix NPE is output type spec not declared

GitLab CEE Bot added a comment - 2024/05/29 3:43 AM

CPaaS Service Account mentioned this issue in a merge request of openshift-logging / Log Collection Midstream on branch openshift-logging-6.0-rhel-9_upstream_d7ac97e0b6e0dc061202f5953efe63cc:

Updated US source to: 1977791 ~~LOG-5131~~: Remove old telemetry implementation from development branch

GitLab CEE Bot added a comment - 2024/05/29 3:43 AM CPaaS Service Account mentioned this issue in a merge request of openshift-logging / Log Collection Midstream on branch openshift-logging-6.0-rhel-9_ upstream _d7ac97e0b6e0dc061202f5953efe63cc : Updated US source to: 1977791 LOG-5131 : Remove old telemetry implementation from development branch

Assignee:: Vitalii Parfonov

Reporter:: Vitalii Parfonov

Votes:: 0 Vote for this issue

Watchers:: 3 Start watching this issue

Created:: 2024/04/15 10:29 AM

Updated:: 2024/09/24 3:25 PM

Resolved:: 2024/05/03 8:48 AM

Details

Description

Summary

Acceptance Criteria

Notes

Attachments

Issue Links

Easy Agile Planning Poker

Activity

[LOG-5381] Implement Alerts and Metrics Dashboard for Vector Output Buffer

Collapse comment: Errata Tool added a comment - 2024/09/24 3:25 PM

Expand comment: Errata Tool added a comment - 2024/09/24 3:25 PM

Collapse comment: GitLab CEE Bot added a comment - 2024/06/22 3:45 AM

Expand comment: GitLab CEE Bot added a comment - 2024/06/22 3:45 AM

Collapse comment: GitLab CEE Bot added a comment - 2024/06/19 3:45 AM

Expand comment: GitLab CEE Bot added a comment - 2024/06/19 3:45 AM

Collapse comment: GitLab CEE Bot added a comment - 2024/06/11 3:44 AM

Expand comment: GitLab CEE Bot added a comment - 2024/06/11 3:44 AM

Collapse comment: GitLab CEE Bot added a comment - 2024/06/04 3:44 AM

Expand comment: GitLab CEE Bot added a comment - 2024/06/04 3:44 AM

Collapse comment: GitLab CEE Bot added a comment - 2024/06/01 3:44 AM

Expand comment: GitLab CEE Bot added a comment - 2024/06/01 3:44 AM

Collapse comment: GitLab CEE Bot added a comment - 2024/05/31 7:39 AM

Expand comment: GitLab CEE Bot added a comment - 2024/05/31 7:39 AM

Collapse comment: GitLab CEE Bot added a comment - 2024/05/30 4:09 PM

Expand comment: GitLab CEE Bot added a comment - 2024/05/30 4:09 PM

Collapse comment: GitLab CEE Bot added a comment - 2024/05/30 9:14 AM

Expand comment: GitLab CEE Bot added a comment - 2024/05/30 9:14 AM

Collapse comment: GitLab CEE Bot added a comment - 2024/05/29 3:43 AM

Expand comment: GitLab CEE Bot added a comment - 2024/05/29 3:43 AM

People

Dates