Uploaded image for project: 'Red Hat Data Grid'
  1. Red Hat Data Grid
  2. JDG-3991

OpenShift: Server killed by OOM restarts forever after

XMLWordPrintable

    • Impediment
    • Hide

      Shutdown whole cluster and bring it back.

      Show
      Shutdown whole cluster and bring it back.
    • Hide
      • Deploy a default cluster of two nodes
      • Start filling default cache till eviction starts removing incoming entries
      • Keep sending entries, one of the servers gets eventually killed while memory usage is close to a maximum (512MB in this case)
      Show
      Deploy a default cluster of two nodes Start filling default cache till eviction starts removing incoming entries Keep sending entries, one of the servers gets eventually killed while memory usage is close to a maximum (512MB in this case)
    • DataGrid Sprint #56, DataGrid Sprint #61, DataGrid Sprint #62

      When user fully fills default cache, even though eviction is in place, server gets eventually killed while under the load. Server killed that way isn't able to start properly after that due to:

      FATAL (main) [org.infinispan.SERVER] ISPN080028: Red Hat Data Grid Server failed to start java.util.concurrent.ExecutionException: org.infinispan.manager.EmbeddedCacheManagerStartupException: org.infinispan.commons.CacheException: Initial state transfer timed out for cache org.infinispan.LOCKS on example-infinispan-0-61960
      

      Full failing server log in the attachment.

            remerson@redhat.com Ryan Emerson
            pdrobek@redhat.com Pavel Drobek
            Votes:
            0 Vote for this issue
            Watchers:
            12 Start watching this issue

              Created:
              Updated:
              Resolved: