Uploaded image for project: 'Infinispan'
  1. Infinispan
  2. ISPN-6047

Deadlock when a prepare command is retried

This issue belongs to an archived project. You can view it, but you can't modify it. Learn more

XMLWordPrintable

    • Icon: Bug Bug
    • Resolution: Done
    • Icon: Major Major
    • 8.2.0.CR1, 8.1.4.Final
    • 8.1.0.Final
    • Core
    • None

      Looks like the ISPN-5623 fix went too far, and now I found a test failure with the opposite behaviour:

      1. Remote prepare for txA acquires lock K
      2. Remote prepare for txB blocks waiting for lock K
      3. The topology changes, and the txA prepare is retried
      4. The txA prepare times out, because it waits for pending transaction txB to finish.

      So we have to make txA somehow know that it already has the lock after it received an UnsureResponse for the prepare command, and skip waiting for pending transactions.

      I found the problem in a random failure of DistributedFourNodesMapReduceTest on a local branch, but I'm not sure if my local changes (making SyncCHF the default CH factory) made it more likely.

              pruivo@redhat.com Pedro Ruivo
              dberinde@redhat.com Dan Berindei (Inactive)
              Archiver:
              rhn-support-adongare Amol Dongare

                Created:
                Updated:
                Resolved:
                Archived: