Loading...

XML

Word

Printable

Type: Feature
Resolution: Obsolete
Priority: Major
Fix Version/s: None
Affects Version/s: Logging 5.7, Logging 5.6, Logging 5.8
Component/s: Log Collection, PM Logging
Labels:
- cee.neXT

Blocked:
False
Blocked Reason:

Hide

None

Show
None
Ready:
False
Color Status:
Not Selected

SFDC Cases Links:
SFDC Cases Open:
SFDC Cases Counter:

PX Priority Data:
PX Impact Score:
PX Review Complete:

Intelligence Requested:
Market:

Proposed title of this feature request
Vector and fluentd comparative
What is the nature and description of the request?

Currently, Vector is GA as collector and Fluentd is deprecated. But, it's not clear how to do this migration, the impact of doing it (logs not compressed on the node are again reprocessed), pros/cons, performance comparative, etc. 

This darkness on the change of the solution and how it works Fluentd and how it works Vector (in memory currently) leads to don't help in the adoption/transition/migration from Fluentd to Vector. Then, it should be so much helpful to have a comparative between both.

Some points that usually are requested are:

Vector and fluentd comparative
  - On features (present on the current Logging documentation) [1]
  - Simulate a load, the same for fluentd and Vector and:
    + share the results of memory and cpu usage for both
    + number of events dropped
    + number of event reads per second
    + time to ingest the load
  - Vector works in memory out of the box vs Fluentd works using disk buffering then:
    + reliability vs performance
    + usage of memory of Vector can be bigger when having backpressure when delivering the logs
  - Define how works by default the outputs
    + Retry (what's retried and not)
    + drop or block
    + Vector has 500 events per output in memory, fluentd uses buffering to disk of 8GB by default per output
    + Vector adaptative concurrency

Also, indicate for the migration, that all the logs not compressed will be reprocessed by Vector that can lead to:
  - have duplicated logs in the moment of the migration
  - 429 Too many requests in the Log storage receiving the logs or reaching Rate Limit
  - problems on the log store on disk and performance as consequence of re-reading and processing all old logs the collector
  - impact in the Kube API
  - a peak of memory and cpu in Vector until all the old logs are processed (these logs can be several GB per node). This also could lead to a big impact. By instance, an example of the impact:

+ LogMaxSize: 100M, they are uncompressed 2 logs per pod
+ Node with 100 pods

GB to be read inmediately:
  - Per pods 100MB x 2 x 100 = 20G for this node for pods
  - + journal logs
  - + audit logs from node

Why does the customer need this? (List the business requirements)

Guide on the migration from Fluentd to Vector and also facilitate the adoption of Vector explaining how it works and pros/cons

List any affected packages or components.

Collector: Vector

Additional information{}

It should be good with a tool/script/steps being able to estimate how many GB from Journald, Audit, infrastructure and Applications will be read in the moment of the migration to know better the impact and with that, schedule the best business time with low load to make the migration from Fluentd to Vector.

links to

[KCS] Migrating the log collector from fluentd to vector in RHOCP 4

openshift/openshift-docs#80999: Release branch preview gen.

openshift/openshift-docs#93577: OBSDOCS-1701: Upgrading to Logging 6 steps Final

Assignee:: Jamie Parker

Reporter:: Oscar Casal Sanchez

Votes:: 3 Vote for this issue

Watchers:: 14 Start watching this issue

Created:: 2023/10/26 9:29 AM

Updated:: 2025/09/14 12:57 AM

Resolved:: 2025/07/31 5:03 PM

Details

Description

Attachments

Issue Links

Easy Agile Planning Poker

Activity

People

Dates