Loading...

XML

Word

Printable

Type: Bug
Resolution: Obsolete
Priority: Critical
Fix Version/s: None
Affects Version/s: AMQ 7.4.0.GA
Component/s: clustering, high-availability
Labels:
- downstream-verification-needed
Environment:

3 pairs HA same deployment like in ~~ENTMQBR-2476~~

Release Note Text:

Hide
In a cluster of three or more live-backup groups that is using the replication high availability (HA) policy, the live broker shuts down when its replication connection fails. However, when the replication connection is restored and the original live broker is restarted, the broker is sometimes unable to rejoin the broker cluster. To enable the original live broker to rejoin the cluster, first stop the new live (original backup) broker, restart the original live broker, and then restart the original backup broker.

Show
In a cluster of three or more live-backup groups that is using the replication high availability (HA) policy, the live broker shuts down when its replication connection fails. However, when the replication connection is restored and the original live broker is restarted, the broker is sometimes unable to rejoin the broker cluster. To enable the original live broker to rejoin the cluster, first stop the new live (original backup) broker, restart the original live broker, and then restart the original backup broker.
Release Note Status:
Documented as Known Issue
Target Release:

AMQ 7.5.0.GA
Steps to Reproduce:
Hide

Deploy HA 3 master slaves

Isolate 1 master (firewall rules)

Make sure slave takes control and master goes down

disable all firewall rules (restore connection)

observe that master is unable to join the cluster
Show
Deploy HA 3 master slaves Isolate 1 master (firewall rules) Make sure slave takes control and master goes down disable all firewall rules (restore connection) observe that master is unable to join the cluster

SFDC Cases Counter:
SFDC Cases Open:
SFDC Cases Links:

Once isolated broker is ressurected it can't join the original cluster, thus it has overall no effect that it went down. Seems like whole cluster of brokers needs to be restarted.

There is a usual message on all brokers like

2019-07-12 13:19:35,135 WARN  [org.apache.activemq.artemis.core.client] AMQ212034: There are more than one servers on the network broadcasting the same node id. You will see this message exactly once (per node) if a node is restarted, in which case it can be safely ignored. But if it is logged continuously it means you really do have more than one node on the same network active concurrently with the same node id. This could occur if you have a backup node active at the same time as its live node. nodeID=fae12f12-a493-11e9-89d6-fa163ec19b2d

- - Sort By Name
  - Sort By Date
  - Ascending
  - Descending
  - Thumbnails
  - List
  - Download All

masterB_topology_view.png
2019/07/15 6:27 AM
27 kB
Michal Toth
alive_masters_topology_view.png
2019/07/15 6:27 AM
23 kB
Michal Toth

relates to

ENTMQBR-2476 Live server does not shutdown when using vote-on-replication-failure

Closed

mentioned in: Page Loading...

Assignee:: Andy Taylor

Reporter:: Michal Toth

Votes:: 0 Vote for this issue

Watchers:: 3 Start watching this issue

Created:: 2019/07/12 7:21 AM

Updated:: 2022/09/09 7:10 AM

Resolved:: 2020/11/04 4:26 AM

Details

Description

Attachments

Attachments

Issue Links

Easy Agile Planning Poker

Activity

People

Dates

Hide