[
https://issues.apache.org/jira/browse/ARTEMIS-3345?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Francesco Nigro updated ARTEMIS-3345:
-------------------------------------
Description:
A failing-back master forget its Node ID and on broker restart, having a
different Node ID it can become live, without checking for any existing live
with its old Node ID.
This is happen because of this mechanics from SharedNothingBackupActivation:
# {{SharedNothingBackupActivation::init}} is going to call
activeMQServer.resetNodeManager that's re-creating a NodeManager with
replicatingBackup == true
# SharedNothingBackupActivation::run is then
{code:java}
// move all data away:
activeMQServer.getNodeManager().stop();
activeMQServer.moveServerData(replicaPolicy.getMaxSavedReplicatedJournalsSize());
activeMQServer.getNodeManager().start();
{code}
was:
A failing-back master forget its Node ID if any error happen on failover and
restart as an empty backup.
On broker restart, it got a different NodeID and become live.
> Shared-Nothing Replication Master loose its Node ID on failed fail-back
> -----------------------------------------------------------------------
>
> Key: ARTEMIS-3345
> URL: https://issues.apache.org/jira/browse/ARTEMIS-3345
> Project: ActiveMQ Artemis
> Issue Type: Bug
> Reporter: Francesco Nigro
> Assignee: Francesco Nigro
> Priority: Major
>
> A failing-back master forget its Node ID and on broker restart, having a
> different Node ID it can become live, without checking for any existing live
> with its old Node ID.
> This is happen because of this mechanics from SharedNothingBackupActivation:
> # {{SharedNothingBackupActivation::init}} is going to call
> activeMQServer.resetNodeManager that's re-creating a NodeManager with
> replicatingBackup == true
> # SharedNothingBackupActivation::run is then
> {code:java}
> // move all data away:
> activeMQServer.getNodeManager().stop();
>
> activeMQServer.moveServerData(replicaPolicy.getMaxSavedReplicatedJournalsSize());
> activeMQServer.getNodeManager().start();
> {code}
--
This message was sent by Atlassian Jira
(v8.3.4#803005)