- 3 nodes, server mode with Partition handling enabled
- 2 nodes are killed and bring back online
- the nodes are unable to merge and the cluster remains in degraded mode.
I suspect that the FORK channel/protocol is the culprit since the heartbeat command is never handled in the joiner node, but the coordinator receives a CacheNotFoundResponse quickly (i.e. without timeout). The request is received and "delivered" but never reaches Infinispan.
When starting node 1 (logs from coordinator):
When I started node 2:
It is always reproducible. The configuration is