-
Bug
-
Resolution: Done
-
Blocker
-
EAP 5.0.0.BETA
-
None
-
Brno's jawas
Server have problem reconnecting after some have been shut down and started again.
Test scenario:
1/ all nodes start up
2/ clients starts sending requests
<->
3/ node x shuts down
4/ node x starts up
Problem occurs when (in this case 3rd) node tries to join the cluster.
Excerpt from logging:
[JBoss] 14:58:22,671 TRACE [org.jgroups.protocols.pbcast.GMS] processing [SUSPECT(10.34.32.156:47225)]
[JBoss] 14:58:22,671 TRACE [org.jgroups.protocols.pbcast.GMS] processing [SUSPECT(10.34.32.156:47225)]
[JBoss] 14:58:22,935 TRACE [org.jgroups.protocols.pbcast.GMS] processing [SUSPECT(10.34.32.156:47225)]
[JBoss] 14:58:22,935 TRACE [org.jgroups.protocols.pbcast.GMS] processing [SUSPECT(10.34.32.156:47225)]
[JBoss] 14:58:25,271 WARN [org.jgroups.protocols.pbcast.GMS] join(10.34.32.155:50281) sent to 10.34.32.156:47225 timed out (after 3000 ms), retrying
[JBoss] 14:58:25,271 WARN [org.jgroups.protocols.pbcast.GMS] join(10.34.32.155:50281) sent to 10.34.32.156:47225 timed out (after 3000 ms), retrying
[JBoss] 14:58:25,272 DEBUG [org.jgroups.protocols.pbcast.GMS] initial_mbrs are [[own_addr=10.34.32.153:33086, coord_addr=10.34.32.153:33086, is_server=false], [own_addr=10.34.32.154:59773, coord_addr=10.34.32.154:59773, is_server=false], [own_addr=10.34.32.156:47225, coord_addr=10.34.32.156:47225, is_server=true]]
[JBoss] 14:58:25,272 DEBUG [org.jgroups.protocols.pbcast.GMS] initial_mbrs are [[own_addr=10.34.32.153:33086, coord_addr=10.34.32.153:33086, is_server=false], [own_addr=10.34.32.154:59773, coord_addr=10.34.32.154:59773, is_server=false], [own_addr=10.34.32.156:47225, coord_addr=10.34.32.156:47225, is_server=true]]
[JBoss] 14:58:25,273 DEBUG [org.jgroups.protocols.pbcast.GMS] election results:
[JBoss] 14:58:25,273 DEBUG [org.jgroups.protocols.pbcast.GMS] election results:
{10.34.32.156:47225=1}[JBoss] 14:58:25,273 DEBUG [org.jgroups.protocols.pbcast.GMS] sending handleJoin(10.34.32.155:50281) to 10.34.32.156:47225
[JBoss] 14:58:25,273 DEBUG [org.jgroups.protocols.pbcast.GMS] sending handleJoin(10.34.32.155:50281) to 10.34.32.156:47225
[JBoss] 14:58:25,339 WARN [org.jgroups.protocols.pbcast.GMS] join(10.34.32.155:50281) sent to 10.34.32.153:33086 timed out (after 3000 ms), retrying
[JBoss] 14:58:25,339 WARN [org.jgroups.protocols.pbcast.GMS] join(10.34.32.155:50281) sent to 10.34.32.153:33086 timed out (after 3000 ms), retrying
[JBoss] 14:58:25,340 DEBUG [org.jgroups.protocols.pbcast.GMS] initial_mbrs are [[own_addr=10.34.32.153:33086, coord_addr=10.34.32.153:33086, is_server=true], [own_addr=10.34.32.156:47225, coord_addr=10.34.32.156:47225, is_server=true]]
[JBoss] 14:58:25,340 DEBUG [org.jgroups.protocols.pbcast.GMS] initial_mbrs are [[own_addr=10.34.32.153:33086, coord_addr=10.34.32.153:33086, is_server=true], [own_addr=10.34.32.156:47225, coord_addr=10.34.32.156:47225, is_server=true]]
[JBoss] 14:58:25,340 WARN [org.jgroups.protocols.pbcast.GMS] there was more than 1 candidate for coordinator:
[JBoss] 14:58:25,340 WARN [org.jgroups.protocols.pbcast.GMS] there was more than 1 candidate for coordinator:
{10.34.32.156:47225=1, 10.34.32.153:33086=1}[JBoss] 14:58:25,340 DEBUG [org.jgroups.protocols.pbcast.GMS] sending handleJoin(10.34.32.155:50281) to 10.34.32.156:47225
[JBoss] 14:58:25,340 DEBUG [org.jgroups.protocols.pbcast.GMS] sending handleJoin(10.34.32.155:50281) to 10.34.32.156:47225
[JBoss] 14:58:28,275 WARN [org.jgroups.protocols.pbcast.GMS] join(10.34.32.155:50281) sent to 10.34.32.156:47225 timed out (after 3000 ms), retrying
[JBoss] 14:58:28,275 WARN [org.jgroups.protocols.pbcast.GMS] join(10.34.32.155:50281) sent to 10.34.32.156:47225 timed out (after 3000 ms), retrying
[JBoss] 14:58:28,280 DEBUG [org.jgroups.protocols.pbcast.GMS] initial_mbrs are [[own_addr=10.34.32.153:33086, coord_addr=10.34.32.153:33086, is_server=false], [own_addr=10.34.32.154:59773, coord_addr=10.34.32.154:59773, is_server=false], [own_addr=10.34.32.156:47225, coord_addr=10.34.32.156:47225, is_server=true]]
[JBoss] 14:58:28,280 DEBUG [org.jgroups.protocols.pbcast.GMS] initial_mbrs are [[own_addr=10.34.32.153:33086, coord_addr=10.34.32.153:33086, is_server=false], [own_addr=10.34.32.154:59773, coord_addr=10.34.32.154:59773, is_server=false], [own_addr=10.34.32.156:47225, coord_addr=10.34.32.156:47225, is_server=true]]
[JBoss] 14:58:28,280 DEBUG [org.jgroups.protocols.pbcast.GMS] election results:
[JBoss] 14:58:28,280 DEBUG [org.jgroups.protocols.pbcast.GMS] election results:
{10.34.32.156:47225=1}[JBoss] 14:58:28,280 DEBUG [org.jgroups.protocols.pbcast.GMS] sending handleJoin(10.34.32.155:50281) to 10.34.32.156:47225
[JBoss] 14:58:28,280 DEBUG [org.jgroups.protocols.pbcast.GMS] sending handleJoin(10.34.32.155:50281) to 10.34.32.156:47225
[JBoss] 14:58:28,343 WARN [org.jgroups.protocols.pbcast.GMS] join(10.34.32.155:50281) sent to 10.34.32.156:47225 timed out (after 3000 ms), retrying
[JBoss] 14:58:28,343 WARN [org.jgroups.protocols.pbcast.GMS] join(10.34.32.155:50281) sent to 10.34.32.156:47225 timed out (after 3000 ms), retrying
[JBoss] 14:58:28,344 DEBUG [org.jgroups.protocols.pbcast.GMS] initial_mbrs are [[own_addr=10.34.32.153:33086, coord_addr=10.34.32.153:33086, is_server=true], [own_addr=10.34.32.156:47225, coord_addr=10.34.32.156:47225, is_server=true]]
[JBoss] 14:58:28,344 DEBUG [org.jgroups.protocols.pbcast.GMS] initial_mbrs are [[own_addr=10.34.32.153:33086, coord_addr=10.34.32.153:33086, is_server=true], [own_addr=10.34.32.156:47225, coord_addr=10.34.32.156:47225, is_server=true]]
[JBoss] 14:58:28,344 WARN [org.jgroups.protocols.pbcast.GMS] there was more than 1 candidate for coordinator:
[JBoss] 14:58:28,344 WARN [org.jgroups.protocols.pbcast.GMS] there was more than 1 candidate for coordinator:
{10.34.32.156:47225=1, 10.34.32.153:33086=1}[JBoss] 14:58:28,344 DEBUG [org.jgroups.protocols.pbcast.GMS] sending handleJoin(10.34.32.155:50281) to 10.34.32.156:47225
[JBoss] 14:58:28,344 DEBUG [org.jgroups.protocols.pbcast.GMS] sending handleJoin(10.34.32.155:50281) to 10.34.32.156:47225
- blocks
-
JBPAPP-1260 EAP5 failover testing issues - Tracker JIRA
- Closed
- relates to
-
JBPAPP-2337 Starvation of the OOB thread pool causing lock timeouts and lock upgrade failures
- Resolved