Loading...

This issue belongs to an archived project. You can view it, but you can't modify it. Learn more

XML

Word

Printable

Type: Bug
Resolution: Done
Priority: Major
Fix Version/s: 8.2.5.Final, 9.0.0.Final
Affects Version/s: 8.2.2.Final, 9.0.0.Final
Component/s: Core
Labels:
None

Git Pull Request:
https://github.com/infinispan/infinispan/pull/4441, https://github.com/infinispan/infinispan/pull/4664

Note: This is a scenario that happens in the stress tests, with 4 nodes in dist mode, and 200+ threads per node doing only reads. I have not been able to reproduce it locally, even with a much lower OOB thread pool size and UFC.max_credits.

We don't use the NO_FC flag, so threads sending both requests and responses can block in UFC/MFC. Remote gets are executed directly on the OOB thread, so when we run out of credits for one node, the OOB pool can quickly become full with threads waiting to send a remote get response to that node.

While we can't send responses to that node, we won't send credits to it, either, as credits are only sent after the message has been processed by the application. That means OOB threads on all nodes will start blocking, trying to send remote get responses to us.

This is made a worse by our staggering of remote gets. As remote get responses block, the stagger timeout kicks in and we send even more remote gets, making it even harder for the system to recover.

UFC/MFC can send a CREDIT_REQUEST message to ask for more credits. The REPLENISH messages are handled on JGroups' internal thread pool, so they are not blocked. However, the CREDIT_REQUEST can be sent at most once every UFC.max_block_time ms, so they can't be relied on to provide enough credits. With the default settings, the throughput would be max_credits / max_block_time == 2mb / 0.5s == 4mb/s, which is really small compared to regular throughput.

is blocked by

ISPN-6849 Upgrade to JGroups 3.6.10.Final

Closed

is incorporated by

ISPN-7101 Backports for 8.2.5.Final

Closed

is related to

JGRP-2084 FlowControl: receiver should replenish credits after message delivery, not before

Resolved

Assignee:: Dan Berindei (Inactive)

Reporter:: Dan Berindei (Inactive)

Archiver:: Amol Dongare

Created:: 2016/06/27 6:25 AM

Updated:: 2024/07/15 9:25 AM

Resolved:: 2016/07/08 2:38 PM

Archived:: 2024/11/28 6:21 AM

Details

Description

Attachments

Issue Links

Easy Agile Planning Poker

Activity

People

Dates

PagerDuty