-
Bug
-
Resolution: Unresolved
-
Critical
-
None
-
None
-
False
-
-
False
-
-
URL
DESCRIPTION
In the past, for example, in the example available in https://docs.openshift.com/container-platform/4.14/logging/config/cluster-logging-memory.html:
collection: logs: type: "fluentd" fluentd: resources: limits: memory: 736Mi requests: 3 cpu: 200m memory: 736Mi ... 3. Specify the CPU and memory limits and requests for the log collector as needed.
The examples given where given indicating limits and also saying "Specify the CPU and memory limits and requests for the log collector as needed."
In all the examples, the `limits` and `resources` are not set as:
spec: # ... collection: type: <log_collector_type> resources: {} tolerations: {}
And this is not a good example for being given as the collector could use a huge memory and cpu leading to bring down masters and worker nodes.
Then, it should be good to have good examples and with a good explanation as the one in https://docs.openshift.com/container-platform/4.14/logging/config/cluster-logging-memory.html, but also setting the cpu limit and with the explanation given below with words to "adjust" to the needs
collection: logs: type: "fluentd" fluentd: resources: limits: cpu: 1 memory: 736Mi requests: 3 cpu: 200m memory: 736Mi ... 3. Specify the CPU and memory limits and requests for the log collector as needed.
Having bad examples without limits leads to problems like: https://issues.redhat.com/browse/LOG-4536 , mostly when Vector works all in memory without buffering on disk.
NOTES
The "good examples" should be also adapted to use the new "style". In the past, it was reported in https://issues.redhat.com/browse/OBSDOCS-79 and it's closed.
In parallel, and as part of the modifying the examples, a big recommendation should be across of the collector configuration recommending always to set the limits. It was requested in https://issues.redhat.com/browse/LOG-4745 to set a default limit when not set by the admin, but it was rejected, then, when not a limit exists can happen the detailed in the bug https://issues.redhat.com/browse/LOG-4536 where the nodes were getting exhausted in memory all used by Vector impacting to the business loads
- is related to
-
LOG-4745 Set default limits for Vector if not user defined values
- Closed
- relates to
-
OBSDOCS-726 ClusterLogging note for Multi ClusterLogForwarder feature
- In Progress
-
OBSDOCS-79 Logging: Update CLusterLogging config
- Closed
- links to