Uploaded image for project: 'Red Hat Data Grid'
  1. Red Hat Data Grid
  2. JDG-3991

OpenShift: Server killed by OOM restarts forever after

    XMLWordPrintable

Details

    • Impediment
    • Hide

      Shutdown whole cluster and bring it back.

      Show
      Shutdown whole cluster and bring it back.
    • Hide
      • Deploy a default cluster of two nodes
      • Start filling default cache till eviction starts removing incoming entries
      • Keep sending entries, one of the servers gets eventually killed while memory usage is close to a maximum (512MB in this case)
      Show
      Deploy a default cluster of two nodes Start filling default cache till eviction starts removing incoming entries Keep sending entries, one of the servers gets eventually killed while memory usage is close to a maximum (512MB in this case)
    • DataGrid Sprint #56, DataGrid Sprint #61, DataGrid Sprint #62

    Description

      When user fully fills default cache, even though eviction is in place, server gets eventually killed while under the load. Server killed that way isn't able to start properly after that due to:

      FATAL (main) [org.infinispan.SERVER] ISPN080028: Red Hat Data Grid Server failed to start java.util.concurrent.ExecutionException: org.infinispan.manager.EmbeddedCacheManagerStartupException: org.infinispan.commons.CacheException: Initial state transfer timed out for cache org.infinispan.LOCKS on example-infinispan-0-61960
      

      Full failing server log in the attachment.

      Attachments

        Issue Links

          Activity

            People

              remerson@redhat.com Ryan Emerson
              pdrobek@redhat.com Pavel Drobek
              Votes:
              0 Vote for this issue
              Watchers:
              12 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: