Uploaded image for project: 'Infinispan'
  1. Infinispan
  2. ISPN-2495

Replication on startup not reliable

    XMLWordPrintable

Details

    • Bug
    • Resolution: Done
    • Blocker
    • 5.2.0.Beta6
    • 5.2.0.Beta3
    • State Transfer
    • None

    Description

      Situation: REPL cache with Cache Loader and ~500k entries and preload = true, and shared cache store.

      Problem: 1st node starts and the startup (until cache loaded all entries and so on) takes lets say 2min. When starting the 2nd node after e.g. 30s, then state is requested, but contains no results. cache loader is also not used, since its not the 1st node. Log from 2nd node is attached.
      So I've REPL cache over two (or more ) nodes, where only the 1st node contains the entries.

      When starting all other nodes later, then state transfer works and the data gets replicated.

      Maybe startup of all other nodes should be blocked until warm start of first node has finished?

      Attachments

        1. 2495.patch
          14 kB
        2. 2ND.log
          17 kB

        Issue Links

          Activity

            People

              dberinde@redhat.com Dan Berindei (Inactive)
              tfromm_jira Thomas Fromm (Inactive)
              Votes:
              2 Vote for this issue
              Watchers:
              7 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: