Description
Due to a timing issue, VERIFY_SUSPECT can kick a good member from the cluster soon after joining.
A live node currently in the cluster stops responding temporarily (garbage collection, NIC down, etc).
FD sends up a suspect message to VERIFY_SUSPECT.
VERIFY_SUSPECT times out and sends up the suspect message.
Before the node is kicked from the cluster, FD sends up another suspect message to VERIFY_SUSPECT.
Node is kicked from the cluster.
Node resumes, is shunned, and rejoins.
VERIFY_SUSPECT times out and sends up the suspect message.
Node is kicked from the cluster again.
Attachments
Issue Links
- blocks
-
JBPAPP-7456 JGroups VERIFY_SUSPECT can kick a live node soon after it re-joins
- Closed