Uploaded image for project: 'Infinispan'
  1. Infinispan
  2. ISPN-1769

Rehashing hang ups leading to nodes starting as sole member in cluster

    XMLWordPrintable

Details

    • Bug
    • Resolution: Done
    • Blocker
    • 5.1.0.FINAL
    • 5.1.0.CR4
    • Core
    • None

    Description

      It seems we still have rehashing issues starting nodes, see email below:

      Hey Dan,

      I'm trying to run the old Infinispan Lab where I start 4 nodes, each of which starts a DIST_SYNC cache (Infinispan 5.1.0.CR4) with these configurations:

      new ConfigurationBuilder()
      .clustering()
      .cacheMode(CacheMode.DIST_SYNC)
      .l1().disable()
      .jmxStatistics()
      .build();

      new DefaultCacheManager(
      GlobalConfigurationBuilder.defaultClusteredBuilder()
      .transport()
      .addProperty("configurationFile", "jgroups.xml")
      .build()
      );

      And I got a hang in one of the nodes that ended up starting on its own. This is very similar to the issues we had back in November.

      I don't have TRACE logs yet but I have a thread dump of all the nodes which you can find attached.

      It's run in AS7 domain model so the output of all processes is mixed up. The node that doesn't start in time is 'Server:server-four', so you can grep by that.

      There's barely 7 entries in memory and should not have up like this.

      I'm gonna try to get some TRACE logs.

      Attachments

        Activity

          People

            dberinde@redhat.com Dan Berindei (Inactive)
            rh-ee-galder Galder Zamarreño
            Votes:
            0 Vote for this issue
            Watchers:
            2 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: