[NETOBSERV-1483] Compare performance and resource footprints Direct vs Kafka - Red Hat Issue Tracker

Type: Story
Resolution: Done
Priority: Minor
Fix Version/s: None
Affects Version/s: None
Component/s: Kafka
Labels:
- QE

Story Points:
8
Blocked:
False
Blocked Reason:
None
Ready:
False
Epic Link:
qe-automation-1.7
Intelligence Requested:
Market:

Sprint:
NetObserv - Sprint 250, NetObserv - Sprint 251, NetObserv - Sprint 252, NetObserv - Sprint 253, NetObserv - Sprint 254, NetObserv - Sprint 255

SFDC Cases Links:
SFDC Cases Open:
SFDC Cases Counter:

Our recommendation table here [1] shows configuration without Kafka on 10-nodes clusters and with Kafka on 25-nodes and above. Although there are various reasons to recommend using Kafka anyway, it would be good to have numbers to back this recommendation in terms of performance and resource consumption.

So, we should run some scale-test jobs for the mentioned cluster sizes (10, 25, 65 and 120 nodes), each with and without Kafka. This gives us the following runs / configs:

10 nodes, no Kafka
10 nodes, kafka, 6 replicas, 12 partitions
25 nodes, no kafka
25 nodes, kafka, 12 replicas, 24 partitions*
25 nodes, kafka, 24 replicas, 48 partitions
65 nodes, no kafka
65 nodes, kafka, 24 replicas, 48 partitions
120 nodes, no kafka
120 nodes, kafka, 24 replicas, 48 partitions

*: currently we recommend 24 replicas on 25-nodes cluster .. which perhaps is too much (almost 1 per node) ; I'd just like to verify if it's beneficial / how it compares with just 12 replicas for instance.

PS: I don't know which of the test script makes more sense to provide a realistic workload (ingress-perf? node-density? cluster-density?) - we need to get traffic distributed among a variety of different workloads (ie. involving different deployments), and I think cluster-density does that, but perhaps the others as well.

Goal: Depending on our finding, we may adapt our recommendation doc, and/or provide some precision, such as that we find xx% to yy% additional resource usage when using some mode, so that the users can make a more informed choice.

[1] https://docs.openshift.com/container-platform/4.14/network_observability/configuring-operator.html#network-observability-resources-table_network_observability

Assignee:: Mehul Modi

Reporter:: Joel Takvorian

Votes:: 0 Vote for this issue

Watchers:: 3 Start watching this issue

Created:: 2024/02/06 4:21 PM

Updated:: 2024/06/24 1:33 PM

Resolved:: 2024/06/24 1:33 PM

Details

Description

Attachments

Easy Agile Planning Poker

Activity

People

Dates