Uploaded image for project: 'Infinispan'
  1. Infinispan
  2. ISPN-4841

TopologyAwareConsistentHashFactory is slow for large cluster

    XMLWordPrintable

Details

    • Enhancement
    • Resolution: Unresolved
    • Major
    • None
    • 7.0.0.CR1
    • Core
    • None

    Description

      A user observed 100% CPU usage for a long time on coordinator node when booting 500 nodes with 500 caches defined.

      It looks like the TopologyAwareConsistentHashFactory performs O(n^2), it has double loop for all Machines. It takes 50 sec to compute rebalance with 1 cache 500 nodes. This calculation is performed on every cache, so it eats 25000 sec CPU times with 500 nodes 500 caches.

      The hprof shows 90% of the time is consumed in the TopologyInfo.computeMaxSegmentsForMachine().

      Attachments

        Issue Links

          Activity

            People

              Unassigned Unassigned
              rhn-support-tkimura Takayoshi Kimura
              Votes:
              0 Vote for this issue
              Watchers:
              4 Start watching this issue

              Dates

                Created:
                Updated: