Loading...

XML

Word

Printable

Type: Spike
Resolution: Unresolved
Priority: Normal
Fix Version/s: None
Affects Version/s: None
Component/s: OVN Kubernetes
Labels:
- OVN-Kubernetes

Blocked:
False
Blocked Reason:
None
Ready:
False
Epic Link:
[SDN Backlog] Network Tooling

Cost of Delay:
0
WSJF:
0

SFDC Cases Links:
SFDC Cases Counter:
SFDC Cases Open:

We need to understand under what conditions metric ovs_vswitchd_dp_flows_lookup_lost may increment.

Currently, we understand that if ovs-vswitchd is cpu starved and incoming packet sizes are large, then there is an increased likelihood that this metric may increment.

A test needs to be conducted:

Provision a worker node instance with the smallest CPU resources possible for an OCP node on a cloud provider
Fill the node with cpu intensive workloads
Begin sending jumbo frames to the node (up to 9k?). Figure out the size that wont get fragmented.

Please document your tests to get this metric to increment.

If you cannot get it to increment, then set ovs-vswitchd setting other_config:flow-limit to 0 and retry.

Understanding this metric (and follow on alert) will help highlight when customers worker nodes are overloaded and networking is degraded impacting the user exp.

is cloned by

SDN-3454 Understand if we can detect ovn-controller cpu starvation using existing metric

To Do

Assignee:: Unassigned

Reporter:: Martin Kennelly

Votes:: 0 Vote for this issue

Watchers:: 1 Start watching this issue

Created:: 2022/08/25 10:41 AM

Updated:: 2022/12/08 12:31 PM

Details

Description

Attachments

Issue Links

Easy Agile Planning Poker

Activity

People

Dates