-
Spike
-
Resolution: Done
-
Blocker
-
Logging 6.0.0, Logging 5.9.z
-
3
-
False
-
None
-
False
-
NEW
-
NEW
-
Release Note Not Required
-
-
-
Log Collection - Sprint 251, Log Collection - Sprint 252, Log Collection - Sprint 253
Summary
Investigate options for addressing collectors potentially consuming too much node disk
Acceptance Criteria
- Written Plan of options
- Agreement of a path forward by logging team
- Feasibility of adding into 5.9.z stream
- Create cards for implementing a solution
Notes
- Fluentd singleton introduced a cap of 15% evaluated at runtime to address a 3.x bug where it consumed too much node disk and caused the API node to crash
- OCP 4.x world with multi CLF is more difficult because nodes can vary in size and, there can multipl CLF competing for the same aggregate total disk, unable to make calculations at runtime
Initial Option Ideas
- Alert that triggers when disk buffer usage exceeds percentage of a node's available disk
- is documented by
-
OBSDOCS-1134 Investigate Output Buffer Alerts and Metrics
- Closed
- relates to
-
LOG-5381 Implement Alerts and Metrics Dashboard for Vector Output Buffer
- Closed