Loading...

XML

Word

Printable

Type: Bug
Resolution: Done
Priority: Major
Fix Version/s: 2.3
Affects Version/s: 2.2.8, 2.2.9, 2.2.9.1, 2.2.9.2
Labels:
None

Estimated Difficulty:
High
Workaround:

Workaround Exists
Workaround Description:
Hide

Wait until the cluster has been fully started until sending messages, e.g. register for view changes, increment counter, and when a min number has been exceeded, start sending messages

Crude: a little timeout before sending messages, works fine for smaller clusters (-8 nodes)
Show
Wait until the cluster has been fully started until sending messages, e.g. register for view changes, increment counter, and when a min number has been exceeded, start sending messages Crude: a little timeout before sending messages, works fine for smaller clusters (-8 nodes)

SFDC Cases Counter:
SFDC Cases Open:
SFDC Cases Links:

When we have group

{A,B,C} and D joins, then the coordinator (A) runs the following algorithm for handling JOIN(D):
#1 Compute new view V2={A,B,C,D}
#2 Send unicast response with JOIN_RSP(V2) to D (D installs V2)
#3 Multicast V2 to {A,B,C}

, all install V2

If D multicasts a message to the cluster before the existing members install V2, then those members who hadn't installed V2 when the message from D was received, will discard it because D is not in their view (still V1). If the message from D modified some state, e.g. a put(key,val) for a replicated hashmap, then the hashmaps will have inconsistent states.

SOLUTION:
Swap #3 and #2, multicast V2 first (and wait for all view_acks), then send the JOIN_RSP to D. This way, anyone of

{A,B,C}

could multicast a message to the cluster (including D) before D installed V2, so D would discard the message. However, if there is state involved, D will fetch the state from A anyway and overwrite whatever that spurious message caused to change in state.
The chance of this happening is relatively small anyway. However, the real solution will be FLUSH (see JGroups/doc/design/FLUSH.txt towards the end of the document)

- - Sort By Name
  - Sort By Date
  - Ascending
  - Descending
  - Thumbnails
  - List
  - Download All

ConcurrentStartupTest.java
8 kB
2006/05/22 1:39 AM

Assignee:: Bela Ban

Reporter:: Bela Ban

Votes:: 0 Vote for this issue

Watchers:: 0 Start watching this issue

Created:: 2006/05/19 3:38 PM

Updated:: 2006/05/22 3:13 AM

Resolved:: 2006/05/22 3:13 AM

Details

Description

Attachments

Attachments

Easy Agile Planning Poker

Activity

People

Dates