Loading...

This issue belongs to an archived project. You can view it, but you can't modify it. Learn more

XML

Word

Printable

Type: Bug
Resolution: Done
Priority: Major
Fix Version/s: 5.2.4.Final, 5.3.0.Final
Affects Version/s: 5.2.2.Final
Component/s: State Transfer
Labels:
- nbst

Bugzilla References:
https://bugzilla.redhat.com/show_bug.cgi?id=918887

This happened probably the first time, but the issue is here:

When old coordinator leaves the cluster, it sends a REBALANCE_START as a goodbye. This will trigger rebalance process on some of the nodes. As we do sync GET_TRANSACTIONS, processing this command may take a while.

However, new coordinator will send CH_UPDATE, which will change the current topologyId to a higher id. This command is processed in LocalTopologyManagerImpl synchronized on cacheStatus, but rebalance command has already left its synchronized block when it executes handler.rebalance.

Then, as the old REBALANCE_START tries to call notifyTransactionDataReceived in its finally block, it finds out that the topologyId has increased and throws an exception. But the rebalance is left in inconsistent state (activeTopologyUpdates are non-zero, potentionally waitForState true, DataRehash listener notification not called...).

is incorporated by

ISPN-2825 ClusterTopologyManagerImpl should not hold a lock while invoking an RPC

Closed

Assignee:: Dan Berindei (Inactive)

Reporter:: Radim Vansa (Inactive)

Archiver:: Amol Dongare

Created:: 2013/02/28 6:46 AM

Updated:: 2024/07/15 9:10 AM

Resolved:: 2013/03/06 11:40 AM

Archived:: 2024/11/28 6:21 AM

Details

Description

Attachments

Issue Links

Easy Agile Planning Poker

Activity

People

Dates

PagerDuty