Loading...

XML

Word

Printable

Type: Epic
Resolution: Unresolved
Priority: Minor
Fix Version/s: None
Affects Version/s: None
Component/s: openstack-neutron
Labels:
None

Epic Name:
Adopt active heartbeating timeout mechanism for neutron RPC calls using oslo.messaging
Blocked:
False
Blocked Reason:

Hide

None

Show
None
Ready:
False
Dev Approval:
Committed
Docs Approval:
No Docs Impact
Epic Status:
To Do
PM Approval:
Committed
QE Approval:
Proposed
Hierarchy Progress Bar:

100% To Do, 0% In Progress, 0% Done
Intelligence Requested:
Market:

Workstream:

Networking; Neutron

SFDC Cases Links:
SFDC Cases Counter:
SFDC Cases Open:

This Epic is to track upstream effort to adopt a more robust RPC timeout monitoring mechanism implemented in oslo.messaging when using call_monitor_timeout option of the library to create RPC clients.

Currently, Neutron RPC client implements its own back-off mechanism to handle timeout, which first fails long calls, then repeats them with a higher timeout, and proceeds to do so (up to a limit). The suggestion here is to instead allow oslo.messaging to run active heartbeating / probing of the RPC channel and NOT fail long operations when they take a longer time BUT are not due to a death of neutron call handler.

The implementation promises improvement in loaded cluster behavior when communicating to AMQP agents (neutron-dhcp, neutron-sriov.) Specifically, longer operations in a cluster under load should fail less frequently.

links to

Upstream Neutron RFE

Assignee:: Ihar Hrachyshka

Reporter:: Ihar Hrachyshka

Team:: rhos-dfg-networking-squad-neutron

Votes:: 0 Vote for this issue

Watchers:: 2 Start watching this issue

Created:: 2024/01/16 9:50 PM

Updated:: 2024/05/08 4:49 PM

Details

Description

Attachments

Issue Links

Activity

People

Dates

PagerDuty