Uploaded image for project: 'JBoss Enterprise Application Platform'
  1. JBoss Enterprise Application Platform
  2. JBEAP-25703

[CLUSTERING] ISPN000208: No live owners found for segments

XMLWordPrintable

    • False
    • None
    • False

      Scenario: we have a 4 nodes cluster where each node is configured as follows:

      embed-server --server-config=standalone-ha.xml
       if (outcome != success) of /subsystem=jgroups:read-attribute(name=default-stack)
       /subsystem=jgroups/channel=ee:write-attribute(name=stack,value=tcp)
       else
       /subsystem=jgroups:write-attribute(name=default-stack,value=tcp)
       /subsystem=jgroups/channel=ee:write-attribute(name=stack,value=tcp)
       end-if
      

      Each node is failed by killing the EAP JVM and then restarted;
      At the end of the tests, all nodes are stopped one fetr the other starting with NODE1;

      The error pops out on NODE4 when NODE1 and NODE2 are already stopped and NODE3 is stopping; these are the statuses and logs when the error occurs (complete logs and configurations attached in wildfly-ISPN000208.zip):

      NODE1 is stopped:

      2023-09-25 13:01:23,402 INFO  [org.jboss.as] (MSC service thread 1-2) WFLYSRV0050: JBoss EAP 8.0.0.GA (WildFly Core 21.0.3.Final-redhat-00001) stopped in 805ms
      

      NODE2 is stopped:

      2023-09-25 13:01:40,046 INFO  [org.jboss.as] (MSC service thread 1-1) WFLYSRV0050: JBoss EAP 8.0.0.GA (WildFly Core 21.0.3.Final-redhat-00001) stopped in 563ms
      

      NODE3 is stopping:

      2023-09-25 13:01:56,846 INFO  [org.jboss.as.clustering.infinispan] (ServerService Thread Pool -- 96) WFLYCLINF0003: Stopped default-server cache from web container
      2023-09-25 13:01:56,854 INFO  [org.jboss.as.clustering.infinispan] (ServerService Thread Pool -- 97) WFLYCLINF0003: Stopped clusterbench-ee10.ear.clusterbench-ee10-web-passivating.war cache from web container
      

      NODE4 logs ERROR:

      2023-09-25 13:01:23,419 INFO  [org.infinispan.CLUSTER] (thread-20,ejb,wildfly4) ISPN100001: Node wildfly1 left the cluster
      ...
      2023-09-25 13:01:40,043 INFO  [org.infinispan.CLUSTER] (thread-29,ejb,wildfly4) ISPN100001: Node wildfly2 left the cluster
      ...
      2023-09-25 13:01:40,141 INFO  [org.infinispan.LIFECYCLE] (non-blocking-thread--p7-t1) [Context=http-remoting-connector] ISPN100010: Finished rebalance with members [wildfly3, wildfly4], topology id 64
      2023-09-25 13:01:56,852 ERROR [org.infinispan.statetransfer.StateConsumerImpl] (non-blocking-thread--p11-t1) ISPN000208: No live owners found for segments {72} of cache clusterbench-ee10.ear.clusterbench-ee10-web.war. Excluded owners: []
      2023-09-25 13:01:56,855 ERROR [org.infinispan.statetransfer.StateConsumerImpl] (non-blocking-thread--p11-t1) ISPN000208: No live owners found for segments {72} of cache clusterbench-ee10.ear.clusterbench-ee10-web.war. Excluded owners: []
      2023-09-25 13:01:56,951 INFO  [org.infinispan.CLUSTER] (thread-47,ejb,wildfly4) ISPN100001: Node wildfly3 left the cluster
      

      The client fail rate isn't affected since clients are already stopped when the error occurs;

              pferraro@redhat.com Paul Ferraro
              tborgato@redhat.com Tommaso Borgato
              Votes:
              0 Vote for this issue
              Watchers:
              3 Start watching this issue

                Created:
                Updated: