[
https://issues.apache.org/jira/browse/NIFI-2292?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15392450#comment-15392450
]
ASF GitHub Bot commented on NIFI-2292:
--------------------------------------
GitHub user markap14 opened a pull request:
https://github.com/apache/nifi/pull/717
NIFI-2292: Funnel all cluster node status changes through the cluster
coordinator…
You can merge this pull request into a Git repository by running:
$ git pull https://github.com/markap14/nifi NIFI-2292-rebased
Alternatively you can review and apply these changes as the patch at:
https://github.com/apache/nifi/pull/717.patch
To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:
This closes #717
----
commit 15449162f55acc110bcf67d409738b68e3a37801
Author: Mark Payne <[email protected]>
Date: 2016-07-25T17:41:48Z
NIFI-2292: Funnel all cluster node status changes through the cluster
coordinator instead of having each node broadcast changes to the whole cluster.
This gives us the ability to increment the updateId consistently without race
conditions.
commit 0edda2187d0055ebd5a52401f86e1773e37d802c
Author: Mark Payne <[email protected]>
Date: 2016-07-21T18:30:08Z
NIFI-2292: Funnel all cluster node status changes through the cluster
coordinator instead of having each node broadcast changes to the whole cluster.
This gives us the ability to increment the updateId consistently without race
conditions.
----
> Nodes in cluster sometimes become out-of-sync with actual 'connection state'
> of node
> ------------------------------------------------------------------------------------
>
> Key: NIFI-2292
> URL: https://issues.apache.org/jira/browse/NIFI-2292
> Project: Apache NiFi
> Issue Type: Bug
> Components: Core Framework
> Reporter: Mark Payne
> Assignee: Mark Payne
> Fix For: 1.0.0
>
>
> Occasionally I'll see a node that has a different view of the cluster than
> other nodes. Right now I'm actually seeing "node 1" think it's in
> 'CONNECTING' state while nodes 2-5 think we have 5/5 nodes connected.
> This also can result in a node that is elected cluster coordinator and then
> has that role revoked can continually monitor for heartbeats, even though it
> won't receive them since it's not the coordinator anymore. This results in
> continually logging a message like "Failed to retrieve any new heartbeat
> information for nodes. Will not make any decisions based on heartbeats."
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)