Loading...

XML

Word

Printable

Type: Bug
Resolution: Done
Priority: Major
Fix Version/s: 12.1.2.Final, 13.0.0.Final
Affects Version/s: 12.1.1.Final
Component/s: Core
Labels:
None

Release Note Text:
Undefined
Git Pull Request:
https://github.com/infinispan/infinispan/pull/9251, https://github.com/infinispan/infinispan/pull/9263

In the operator we use zero-capacity nodes to create and restore backups.

Source cluster created
Backup pod started with zero-capacity=true, backup created, node leaves
Source cluster shutdown
Target cluster created (pod test-backup-restore-data-grid-target-0)
Restore pod started with zero-capacity=true, restore restored, node leaves

In our testsuite we're frequently seeing step 5 fail, as the Restore pod is timing out when trying to join the cluster:

 WARN  (timeout-thread--p4-t1) [org.infinispan.CLUSTER] ISPN000071: Caught exception when handling command TopologyJoinCommand{cacheName='someCache', origin=restore-25099, joinInfo=CacheJoinInfo{consistentHashFactory=org.infinispan.distribution.ch.impl.SyncConsistentHashFactory@ffffd8e9, numSegments=256, numOwners=2, timeout=240000, cacheMode=DIST_SYNC, persistentUUID=ccc7e49d-fec1-4131-a783-0435c303edfc, persistentStateChecksum=Optional.empty}, viewId=1} org.infinispan.util.concurrent.TimeoutException

We have also seen issues with the zero-pods at step 2, but I don't have logs for that yet.

Logs are attached.

- - Sort By Name
  - Sort By Date
  - Ascending
  - Descending
  - Thumbnails
  - List
  - Download All

backup
445 kB
2021/04/15 9:04 AM
restore
485 kB
2021/04/15 9:04 AM
test-backup-restore-data-grid-target-0
463 kB
2021/04/15 9:04 AM

Assignee:: Ryan Emerson

Reporter:: Ryan Emerson

Votes:: 0 Vote for this issue

Watchers:: 3 Start watching this issue

Created:: 2021/04/15 9:03 AM

Updated:: 2024/07/12 5:00 PM

Resolved:: 2021/04/26 11:09 AM

Details

Description

Attachments

Attachments

Easy Agile Planning Poker

Activity

People

Dates