Loading...

XML

Word

Printable

Type: Bug
Resolution: Not a Bug
Priority: Critical
Fix Version/s: None
Affects Version/s: AMQ 7.3.0.GA, AMQ 7.2.4.GA, AMQ 7.4.1.GA
Component/s: clustering, high-availability
Labels:
None

GSS Priority:
Workaround:

Workaround Exists
Workaround Description:
Hide

The workaround is to add a small delay before the service is started in systemd service file:

[Service] ExecStartPre=/bin/sleep 30
Show
The workaround is to add a small delay before the service is started in systemd service file: [Service] ExecStartPre=/bin/sleep 30
Steps to Reproduce:

Hide

I used the attached 3-node (6 brokers) configuration to reproduce the issue, along with the attached camel application to create a load on the cluster.

1. Install and configure 6 brokers using the attached configuration (I installed a master and a slave on each of 3 virtual hosts)

2. Start the brokers

3. Build and start the attached camel spring boot example to push messages to the cluster

4. Wait some time for a few thousand messages to accumulate on the broker

5. Find the broker that has the local vs. SAF consumer (the one with the TEST.QUEUE.1-3 addresses)

6. Stop and quickly restart this broker

7. Tail the other broker logs in the cluster and look for repeated duplicate node warnings.

Show
I used the attached 3-node (6 brokers) configuration to reproduce the issue, along with the attached camel application to create a load on the cluster. 1. Install and configure 6 brokers using the attached configuration (I installed a master and a slave on each of 3 virtual hosts) 2. Start the brokers 3. Build and start the attached camel spring boot example to push messages to the cluster 4. Wait some time for a few thousand messages to accumulate on the broker 5. Find the broker that has the local vs. SAF consumer (the one with the TEST.QUEUE.1-3 addresses) 6. Stop and quickly restart this broker 7. Tail the other broker logs in the cluster and look for repeated duplicate node warnings.

SFDC Cases Counter:
SFDC Cases Open:
SFDC Cases Links:

When restarting a master node under a load in a 3-master/slave pair cluster configured for udp multicast discovery, the cluster nodes all begin issuing duplicate node id warnings and do not recover after the node is restarted and rejoins the cluster.

2019-05-25 18:57:26,451 WARN  [org.apache.activemq.artemis.core.client] AMQ212034: There are more than one servers on the network broadcasting the same node id. You will see this message exactly once (per node) if a node is restarted, in which case it can be safely ignored. But if it is logged continuously it means you really do have more than one node on the same network active concurrently with the same node id. This could occur if you have a backup node active at the same time as its live node. nodeID=be93b547-7f3b-11e9-b540-080027990b69

This seems to happen most often in testing when restarting the node to which the load is attached and after some backlog is allowed to develop in the addresses / queues.

- - Sort By Name
  - Sort By Date
  - Ascending
  - Descending
  - Thumbnails
  - List
  - Download All

artemis.log_failed_host-10-0-132-24
44 kB
2019/06/18 10:37 AM
artemis.log_failed_host-10-0-133-83
55 kB
2019/06/18 10:37 AM
logs.zip
4 kB
2019/05/28 4:41 AM
reproducer.tar.gz
24 kB
2019/05/25 7:06 PM

relates to

ENTMQBR-2805 HA Paused master broker is unable to take full control from live slave after sigcont

Closed

ENTMQBR-3204 AMQ Broker start-up issue when there is huge pile up of messages

Closed

Assignee:: Francesco Nigro

Reporter:: Duane Hawkins

Votes:: 0 Vote for this issue

Watchers:: 6 Start watching this issue

Created:: 2019/05/25 7:04 PM

Updated:: 2022/12/19 9:39 AM

Resolved:: 2019/09/13 5:48 AM

Details

Description

Attachments

Attachments

Issue Links

Easy Agile Planning Poker

Activity

People

Dates