Uploaded image for project: 'JBoss Enterprise Application Platform'
  1. JBoss Enterprise Application Platform
  2. JBEAP-21048

Clustering: java.lang.ref.Reference.waitForReferencePendingList in failover tests

XMLWordPrintable

    • Icon: Bug Bug
    • Resolution: Cannot Reproduce
    • Icon: Minor Minor
    • None
    • 7.4.0.Beta
    • Clustering
    • None
    • False
    • False
    • Undefined

      The error is present in EAP Clustering tests where HTTP session data is replicated;

      We spotted the error in 2 kind of tests:

      • tests with replicated cache where fail-over is introduced by shutting down nodes
      • tests with replicated cache where fail-over is introduced by shutting down nodes and replication level is granular
      • tests with replicated cache and shared-sessions where fail-over is introduced by jvmkill
      • tests with replicated SFSB

      In the first kind of tests the nodes are configured as follows:

      embed-server --server-config=standalone-ha.xml
      /subsystem=jgroups/channel=ee:write-attribute(name=stack,value=tcp)
      /subsystem=infinispan/cache-container=web/replicated-cache=testRepl:add()
      /subsystem=infinispan/cache-container=web/replicated-cache=testRepl/component=locking:write-attribute(name=isolation, value=REPEATABLE_READ)
      /subsystem=infinispan/cache-container=web/replicated-cache=testRepl/component=transaction:write-attribute(name=mode, value=BATCH)
      /subsystem=infinispan/cache-container=web/replicated-cache=testRepl/store=file:add()
      /subsystem=infinispan/cache-container=web:write-attribute(name=default-cache, value=testRepl)
      

      In the second kind of tests the nodes are configured as follows:

      embed-server --server-config=standalone-ha.xml
      /subsystem=jgroups/channel=ee:write-attribute(name=stack,value=tcp)
      /subsystem=infinispan/cache-container=web/replicated-cache=testRepl:add()
      /subsystem=infinispan/cache-container=web/replicated-cache=testRepl/component=locking:write-attribute(name=isolation, value=REPEATABLE_READ)
      /subsystem=infinispan/cache-container=web/replicated-cache=testRepl/component=transaction:write-attribute(name=mode, value=BATCH)
      /subsystem=infinispan/cache-container=web/replicated-cache=testRepl/store=file:add()
      /subsystem=infinispan/cache-container=web:write-attribute(name=default-cache, value=testRepl)
      

      In the fourth kind of tests the nodes are configured as follows:

      embed-server --server-config=standalone-ha.xml
      /subsystem=jgroups/channel=ee:write-attribute(name=stack,value=tcp)
      /subsystem=infinispan/cache-container=web/replicated-cache=testReplWeb:add()
      /subsystem=infinispan/cache-container=web/replicated-cache=testReplWeb/component=locking:write-attribute(name=isolation, value=REPEATABLE_READ)
      /subsystem=infinispan/cache-container=web/replicated-cache=testReplWeb/component=transaction:write-attribute(name=mode, value=BATCH)
      /subsystem=infinispan/cache-container=web/replicated-cache=testReplWeb/store=file:add()
      /subsystem=infinispan/cache-container=web:write-attribute(name=default-cache, value=testReplWeb)
      /subsystem=infinispan/cache-container=ejb/replicated-cache=testReplEjb:add()
      /subsystem=infinispan/cache-container=ejb/replicated-cache=testReplEjb/component=locking:write-attribute(name=isolation, value=REPEATABLE_READ)
      /subsystem=infinispan/cache-container=ejb/replicated-cache=testReplEjb/component=transaction:write-attribute(name=mode, value=BATCH)
      /subsystem=infinispan/cache-container=ejb/replicated-cache=testReplEjb/store=file:add()
      /subsystem=infinispan/cache-container=ejb:write-attribute(name=default-cache, value=testReplEjb)
      

      The error happens after the nodes are restarted:

      2021-02-15 11:41:19,763 FATAL [org.jgroups.protocols.TCP] (Connection.Receiver [10.0.144.84:7600 - 10.0.146.24:51588]-9,ejb,wildfly1) wildfly1: thread pool is full (max=100, active=200); thread dump (dumped once, until thread_dump is reset):
      Reference Handler:
          at java.lang.ref.Reference.waitForReferencePendingList(Reference.java:-2)
          at java.lang.ref.Reference.processPendingReferences(Reference.java:241)
          at java.lang.ref.Reference$ReferenceHandler.run(Reference.java:213)
      
      
      Finalizer:
          at java.lang.Object.wait(Object.java:-2)
          at java.lang.ref.ReferenceQueue.remove(ReferenceQueue.java:155)
          at java.lang.ref.ReferenceQueue.remove(ReferenceQueue.java:176)
          at java.lang.ref.Finalizer$FinalizerThread.run(Finalizer.java:170)
      
      
      Signal Dispatcher:
      
      
      Common-Cleaner:
          at java.lang.Object.wait(Object.java:-2)
          at java.lang.ref.ReferenceQueue.remove(ReferenceQueue.java:155)
          at jdk.internal.ref.CleanerImpl.run(CleanerImpl.java:148)
          at java.lang.Thread.run(Thread.java:834)
          at jdk.internal.misc.InnocuousThread.run(InnocuousThread.java:134)
      
      
      Reference Reaper:
          at java.lang.Object.wait(Object.java:-2)
          at java.lang.ref.ReferenceQueue.remove(ReferenceQueue.java:155)
          at java.lang.ref.ReferenceQueue.remove(ReferenceQueue.java:176)
          at org.jboss.modules.ref.References$ReaperThread.run(References.java:64)
      
      
      MSC service thread 1-1:
          at jdk.internal.misc.Unsafe.park(Unsafe.java:-2)
          at java.util.concurrent.locks.LockSupport.parkNanos(LockSupport.java:234)
          at org.jboss.threads.EnhancedQueueExecutor$ThreadBody.run(EnhancedQueueExecutor.java:1421)
          at java.lang.Thread.run(Thread.java:834)
      
      
      MSC service thread 1-2:
          at jdk.internal.misc.Unsafe.park(Unsafe.java:-2)
          at java.util.concurrent.locks.LockSupport.parkNanos(LockSupport.java:234)
          at org.jboss.threads.EnhancedQueueExecutor$ThreadBody.run(EnhancedQueueExecutor.java:1421)
          at java.lang.Thread.run(Thread.java:834)
      etc...
      

      The client is not impacted by the error;

      Find complete logs attached;

      Complete runs here:

              Unassigned Unassigned
              tborgato@redhat.com Tommaso Borgato
              Votes:
              0 Vote for this issue
              Watchers:
              2 Start watching this issue

                Created:
                Updated:
                Resolved: