Loading...

XML

Word

Printable

Type: Bug
Resolution: Not a Bug
Priority: Minor
Fix Version/s: None
Affects Version/s: JBoss A-MQ 6.3
Component/s: broker, kahadb
Labels:
None

GSS Priority:
Steps to Reproduce:
Hide

Reproducer in work. The essentials are:

Create a 3-node broker cluster, with each broker configured to host kahadb in a separate nfs directory. Use the recommended nfs options for mounting

Produce a few thousand (I used 9000 10kb) messages to the cluster

Start some consumers (I started 30 threads) that don't ack messages and just call session.recover()

Let consumers run until all messages are DLQed

Move messages back to original queue

Repeat until problem reproduces and broker fails to restart (with restartAllowed=true)

Stop brokers, clean up index files and restart, observe message counts (easier if consumers are stopped)
Show
Reproducer in work. The essentials are: Create a 3-node broker cluster, with each broker configured to host kahadb in a separate nfs directory. Use the recommended nfs options for mounting Produce a few thousand (I used 9000 10kb) messages to the cluster Start some consumers (I started 30 threads) that don't ack messages and just call session.recover() Let consumers run until all messages are DLQed Move messages back to original queue Repeat until problem reproduces and broker fails to restart (with restartAllowed=true) Stop brokers, clean up index files and restart, observe message counts (easier if consumers are stopped)

SFDC Cases Counter:
SFDC Cases Open:
SFDC Cases Links:

In a 3-node network of brokers with persistence on network storage, there is a possibility of index corruption on failed store operations. With NFS, for example, setting the timeo value to the recommended value of 20 ms can result in Input/Output errors under load, causing a broker restart. Occasionally, this index corruption can result in failure of the broker to pass journal checks and start, necessitating removal of the index files to recover.

Sometimes this seems to result in erroneous message counts and duplicate messages within the NOB. For example, the broker displays a lower number of messages in the queue counts than actually exist in the journals. When the index is removed and rebuilt, counts go up and sometimes extra / duplicate messages are observed.

- - Sort By Name
  - Sort By Date
  - Ascending
  - Descending
  - Thumbnails
  - List
  - Download All

amq.log.gz
541 kB
2020/08/29 7:25 PM
amq.log.tar.gz
1.52 MB
2020/07/01 4:26 PM
amq.log.tar.gz
407 kB
2020/06/27 7:08 PM
logs.630446.tar.gz
4.16 MB
2020/07/03 6:39 PM
logs.journal.trace.tar.gz
1.00 MB
2020/07/09 11:57 AM
logs.tar.gz
10.72 MB
2020/07/08 11:58 PM

Assignee:: Gary Tully

Reporter:: Duane Hawkins

Votes:: 0 Vote for this issue

Watchers:: 3 Start watching this issue

Created:: 2020/06/27 7:08 PM

Updated:: 2024/03/25 6:52 PM

Resolved:: 2021/06/18 12:35 PM

Details

Description

Attachments

Attachments

Easy Agile Planning Poker

Activity

People

Dates