Uploaded image for project: 'WildFly WIP'
  1. WildFly WIP
  2. WFWIP-523

clustering does not work after WF 24 -> 27 migration

XMLWordPrintable

    • Icon: Bug Bug
    • Resolution: Done
    • Icon: Major Major
    • None
    • None
    • Hide
      • Use server migration tool to migrate default standalone-full-ha.xml from WF 24 to 27.
      • Copy standalone to standalone2
      • Execute two server in cluster, e.g.
        • ./bin/standalone.sh -c standalone-full-ha.xml -Djboss.node.name=host1
        • ./bin/standalone.sh -c standalone-full-ha.xml -Djboss.server.base.dir=`pwd`/standalone2 -Djboss.node.name=host2 -Djboss.socket.binding.port-offset=200
      Show
      Use server migration tool to migrate default standalone-full-ha.xml from WF 24 to 27. Copy standalone to standalone2 Execute two server in cluster, e.g. ./bin/standalone.sh -c standalone-full-ha.xml -Djboss.node.name=host1 ./bin/standalone.sh -c standalone-full-ha.xml -Djboss.server.base.dir=`pwd`/standalone2 -Djboss.node.name=host2 -Djboss.socket.binding.port-offset=200

      Using the server migration tool, clustering does not work after WF 24 -> 27 migration:

      15:38:22,023 ERROR [org.jboss.msc.service.fail] (ServerService Thread Pool -- 86) MSC000001: Failed to start service org.wildfly.clustering.jgroups.channel.ee: org.jboss.msc.service.StartException in service org.wildfly.clustering.jgroups.channel.ee: java.lang.IllegalStateException: java.lang.IllegalArgumentException: failed to start server socket
      	at org.wildfly.clustering.service@27.0.0.Final-SNAPSHOT//org.wildfly.clustering.service.FunctionalService.start(FunctionalService.java:66)
      	at org.wildfly.clustering.service@27.0.0.Final-SNAPSHOT//org.wildfly.clustering.service.AsyncServiceConfigurator$AsyncService.lambda$start$0(AsyncServiceConfigurator.java:117)
      	at org.jboss.threads@2.4.0.Final//org.jboss.threads.ContextClassLoaderSavingRunnable.run(ContextClassLoaderSavingRunnable.java:35)
      	at org.jboss.threads@2.4.0.Final//org.jboss.threads.EnhancedQueueExecutor.safeRun(EnhancedQueueExecutor.java:1990)
      	at org.jboss.threads@2.4.0.Final//org.jboss.threads.EnhancedQueueExecutor$ThreadBody.doRunTask(EnhancedQueueExecutor.java:1486)
      	at org.jboss.threads@2.4.0.Final//org.jboss.threads.EnhancedQueueExecutor$ThreadBody.run(EnhancedQueueExecutor.java:1377)
      	at java.base/java.lang.Thread.run(Thread.java:829)
      	at org.jboss.threads@2.4.0.Final//org.jboss.threads.JBossThread.run(JBossThread.java:513)
      Caused by: java.lang.IllegalStateException: java.lang.IllegalArgumentException: failed to start server socket
      	at org.jboss.as.clustering.jgroups@27.0.0.Final-SNAPSHOT//org.jboss.as.clustering.jgroups.subsystem.ChannelServiceConfigurator.get(ChannelServiceConfigurator.java:116)
      	at org.jboss.as.clustering.jgroups@27.0.0.Final-SNAPSHOT//org.jboss.as.clustering.jgroups.subsystem.ChannelServiceConfigurator.get(ChannelServiceConfigurator.java:58)
      	at org.wildfly.clustering.service@27.0.0.Final-SNAPSHOT//org.wildfly.clustering.service.FunctionalService.start(FunctionalService.java:63)
      	... 7 more
      Caused by: java.lang.IllegalArgumentException: failed to start server socket
      	at org.jgroups@5.2.6.Final//org.jgroups.protocols.FD_SOCK.down(FD_SOCK.java:379)
      	at org.jgroups@5.2.6.Final//org.jgroups.protocols.FailureDetection.down(FailureDetection.java:159)
      	at org.jgroups@5.2.6.Final//org.jgroups.protocols.VERIFY_SUSPECT.down(VERIFY_SUSPECT.java:102)
      	at org.jgroups@5.2.6.Final//org.jgroups.protocols.pbcast.NAKACK2.down(NAKACK2.java:619)
      	at org.jgroups@5.2.6.Final//org.jgroups.protocols.UNICAST3.down(UNICAST3.java:616)
      	at org.jgroups@5.2.6.Final//org.jgroups.protocols.pbcast.STABLE.down(STABLE.java:276)
      	at org.jgroups@5.2.6.Final//org.jgroups.protocols.pbcast.GMS.down(GMS.java:890)
      	at org.jgroups@5.2.6.Final//org.jgroups.protocols.FlowControl.down(FlowControl.java:201)
      	at org.jgroups@5.2.6.Final//org.jgroups.protocols.FlowControl.down(FlowControl.java:201)
      	at org.jgroups@5.2.6.Final//org.jgroups.stack.Protocol.down(Protocol.java:283)
      	at org.jgroups@5.2.6.Final//org.jgroups.protocols.FRAG3.down(FRAG3.java:98)
      	at org.jgroups@5.2.6.Final//org.jgroups.stack.Protocol.down(Protocol.java:283)
      	at org.jgroups@5.2.6.Final//org.jgroups.stack.ProtocolStack.down(ProtocolStack.java:929)
      	at org.jgroups@5.2.6.Final//org.jgroups.JChannel.down(JChannel.java:608)
      	at org.jgroups@5.2.6.Final//org.jgroups.JChannel._connect(JChannel.java:808)
      	at org.jgroups@5.2.6.Final//org.jgroups.JChannel.connect(JChannel.java:325)
      	at org.jgroups@5.2.6.Final//org.jgroups.JChannel.connect(JChannel.java:316)
      	at org.jboss.as.clustering.jgroups@27.0.0.Final-SNAPSHOT//org.jboss.as.clustering.jgroups.subsystem.ChannelServiceConfigurator.get(ChannelServiceConfigurator.java:112)
      	... 9 more
      Caused by: java.net.BindException: No available port to bind to in range [54200 .. 54200]
      	at org.jgroups@5.2.6.Final//org.jgroups.util.Util.bind(Util.java:3970)
      	at org.jgroups@5.2.6.Final//org.jgroups.util.Util.createServerSocket(Util.java:3946)
      	at org.jgroups@5.2.6.Final//org.jgroups.protocols.FD_SOCK.startServerSocket(FD_SOCK.java:640)
      	at org.jgroups@5.2.6.Final//org.jgroups.protocols.FD_SOCK.down(FD_SOCK.java:376)
      	... 26 more
      
      15:38:22,128 ERROR [org.jboss.as.controller.management-operation] (Controller Boot Thread) WFLYCTL0013: Operation ("add") failed - address: ([
          ("subsystem" => "jgroups"),
          ("channel" => "ee")
      ]) - failure description: {"WFLYCTL0080: Failed services" => {"org.wildfly.clustering.jgroups.channel.ee" => "java.lang.IllegalStateException: java.lang.IllegalArgumentException: failed to start server socket
          Caused by: java.lang.IllegalStateException: java.lang.IllegalArgumentException: failed to start server socket
          Caused by: java.lang.IllegalArgumentException: failed to start server socket
          Caused by: java.net.BindException: No available port to bind to in range [54200 .. 54200]"}}
      

       
      It would deserve to review updates in cofiguration of jgroups subsystem. RED and UFC protocols were added, and versions of FD_SOCK, FD_ALL, VERIFY_SUSPECT, FRAG3 were increased. Migrated and original default WF 27 configurations do not work together:

      15:56:50,011 WARN  [org.infinispan.topology.ClusterTopologyManagerImpl] (non-blocking-thread--p7-t6) ISPN000329: Unable to read rebalancing status from coordinator host1: java.util.concurrent.CompletionException: org.infinispan.remoting.transport.jgroups.SuspectException: ISPN000400: Node host1 was suspected
      	at java.base/java.util.concurrent.CompletableFuture.encodeThrowable(CompletableFuture.java:331)
      	at java.base/java.util.concurrent.CompletableFuture.completeThrowable(CompletableFuture.java:346)
      	at java.base/java.util.concurrent.CompletableFuture$UniApply.tryFire(CompletableFuture.java:632)
      	at java.base/java.util.concurrent.CompletableFuture.postComplete(CompletableFuture.java:506)
      	at java.base/java.util.concurrent.CompletableFuture.completeExceptionally(CompletableFuture.java:2088)
      	at org.infinispan@14.0.0.Final//org.infinispan.remoting.transport.AbstractRequest.completeExceptionally(AbstractRequest.java:75)
      	at org.infinispan@14.0.0.Final//org.infinispan.remoting.transport.impl.SingleTargetRequest.onNewView(SingleTargetRequest.java:67)
      	at org.infinispan@14.0.0.Final//org.infinispan.remoting.transport.jgroups.JGroupsTransport.lambda$receiveClusterView$4(JGroupsTransport.java:860)
      	at org.infinispan@14.0.0.Final//org.infinispan.remoting.transport.impl.RequestRepository.lambda$forEach$0(RequestRepository.java:59)
      	at java.base/java.util.concurrent.ConcurrentHashMap.forEach(ConcurrentHashMap.java:1603)
      	at org.infinispan@14.0.0.Final//org.infinispan.remoting.transport.impl.RequestRepository.forEach(RequestRepository.java:59)
      	at org.infinispan@14.0.0.Final//org.infinispan.remoting.transport.jgroups.JGroupsTransport.lambda$receiveClusterView$5(JGroupsTransport.java:860)
      	at java.base/java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1128)
      	at java.base/java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:628)
      	at org.wildfly.clustering.context@27.0.0.Final-SNAPSHOT//org.wildfly.clustering.context.ContextReferenceExecutor.execute(ContextReferenceExecutor.java:49)
      	at org.wildfly.clustering.context@27.0.0.Final-SNAPSHOT//org.wildfly.clustering.context.ContextualExecutor$1.run(ContextualExecutor.java:78)
      	at java.base/java.lang.Thread.run(Thread.java:829)
      Caused by: org.infinispan.remoting.transport.jgroups.SuspectException: ISPN000400: Node host1 was suspected
      	at org.infinispan@14.0.0.Final//org.infinispan.remoting.transport.ResponseCollectors.remoteNodeSuspected(ResponseCollectors.java:31)
      	at org.infinispan@14.0.0.Final//org.infinispan.remoting.transport.impl.SingleResponseCollector.targetNotFound(SingleResponseCollector.java:31)
      	at org.infinispan@14.0.0.Final//org.infinispan.remoting.transport.impl.SingleResponseCollector.targetNotFound(SingleResponseCollector.java:17)
      	at org.infinispan@14.0.0.Final//org.infinispan.remoting.transport.ValidSingleResponseCollector.addResponse(ValidSingleResponseCollector.java:23)
      	at org.infinispan@14.0.0.Final//org.infinispan.remoting.transport.impl.SingleTargetRequest.addResponse(SingleTargetRequest.java:75)
      	at org.infinispan@14.0.0.Final//org.infinispan.remoting.transport.impl.SingleTargetRequest.onNewView(SingleTargetRequest.java:63)
      	... 10 more
      

              emartins@redhat.com Eduardo Martins
              okotek@redhat.com Ondrej Kotek
              Ondrej Kotek Ondrej Kotek
              Votes:
              0 Vote for this issue
              Watchers:
              3 Start watching this issue

                Created:
                Updated:
                Resolved: