Loading...

XML

Word

Printable

Type: Bug
Resolution: Unresolved
Priority: Critical
Fix Version/s: None
Affects Version/s: Logging 5.8, Logging 5.9, Logging 6.0, Logging 6.1
Component/s: Logging
Labels:
- cee.next
- content-plan-potential

Blocked:
False
Blocked Reason:

Hide

None

Show
None
Ready:
False
Intelligence Requested:
Market:

SFDC Cases Links:
SFDC Cases Open:
SFDC Cases Counter:

URL

https://docs.openshift.com/container-platform/4.14/logging/cluster-logging-deploying.html#create-cluster-logging-cli_cluster-logging-deploying

https://docs.openshift.com/container-platform/4.14/logging/log_collection_forwarding/log-forwarding.html#about-log-collectors-types_log-forwarding

https://docs.openshift.com/container-platform/4.14/logging/log_collection_forwarding/cluster-logging-collector.html#configuring-logging-collector_cluster-logging-collector

DESCRIPTION

In the past, for example, in the example available in https://docs.openshift.com/container-platform/4.14/logging/config/cluster-logging-memory.html:

   collection:
    logs:
      type: "fluentd"
      fluentd:
        resources: 
          limits:
            memory: 736Mi
          requests: 3
            cpu: 200m
            memory: 736Mi
...
3. Specify the CPU and memory limits and requests for the log collector as needed.

The examples given where given indicating limits and also saying "Specify the CPU and memory limits and requests for the log collector as needed."

In all the examples, the `limits` and `resources` are not set as:

spec:
# ...
  collection:
    type: <log_collector_type> 
    resources: {}
    tolerations: {}

And this is not a good example for being given as the collector could use a huge memory and cpu leading to bring down masters and worker nodes.

Then, it should be good to have good examples and with a good explanation as the one in https://docs.openshift.com/container-platform/4.14/logging/config/cluster-logging-memory.html, but also setting the cpu limit and with the explanation given below with words to "adjust" to the needs

   collection:
    logs:
      type: "fluentd"
      fluentd:
        resources: 
          limits:
            cpu: 1
            memory: 736Mi
          requests: 3
            cpu: 200m
            memory: 736Mi
...
3. Specify the CPU and memory limits and requests for the log collector as needed.

Having bad examples without limits leads to problems like: https://issues.redhat.com/browse/LOG-4536 , mostly when Vector works all in memory without buffering on disk.

NOTES

The "good examples" should be also adapted to use the new "style". In the past, it was reported in https://issues.redhat.com/browse/OBSDOCS-79 and it's closed.

In parallel, and as part of the modifying the examples, a big recommendation should be across of the collector configuration recommending always to set the limits. It was requested in https://issues.redhat.com/browse/LOG-4745 to set a default limit when not set by the admin, but it was rejected, then, when not a limit exists can happen the detailed in the bug https://issues.redhat.com/browse/LOG-4536 where the nodes were getting exhausted in memory all used by Vector impacting to the business loads

is related to

LOG-4745 Set default limits for Vector if not user defined values

Closed

relates to

OBSDOCS-726 ClusterLogging note for Multi ClusterLogForwarder feature

Backlog

OBSDOCS-79 Logging: Update CLusterLogging config

Closed

links to

openshift/openshift-docs#77407: OBSDOCS-736-4.14 - Logging Installation Fix - cee.next

Assignee:: Unassigned

Reporter:: Oscar Casal Sanchez

Votes:: 1 Vote for this issue

Watchers:: 3 Start watching this issue

Created:: 2024/02/23 6:09 PM

Updated:: 2025/01/30 7:40 PM

Details

Description

URL

DESCRIPTION

NOTES

Attachments

Issue Links

Easy Agile Planning Poker

Activity

People

Dates