[ 
https://issues.apache.org/jira/browse/NIFI-1457?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Matt Gilman reassigned NIFI-1457:
---------------------------------

    Assignee: Matt Gilman

> excessive bulletins and logging when primary node is revoked by NCM.
> --------------------------------------------------------------------
>
>                 Key: NIFI-1457
>                 URL: https://issues.apache.org/jira/browse/NIFI-1457
>             Project: Apache NiFi
>          Issue Type: Bug
>    Affects Versions: 0.4.1
>         Environment: centOS 7
>            Reporter: Matthew Clarke
>            Assignee: Matt Gilman
>            Priority: Minor
>             Fix For: 0.5.0
>
>
> I have a 3 node cluster up and running. The current primary node loses 
> connectivity to NCM and eventually becomes disconnected by NCM because of 
> lack of heartbeat.  From NCM, the disconnected node is dropped from cluster 
> and a new primary node is manually elected. When network comms are restored 
> between original primary node and NCM, the NCM receives heartbeat messages 
> once again that claim to be from the primary node. The NCM correctly captures 
> this and revokes that nodes primary role status. The problem is that the 
> bulletins stating that the role has been revoked never stop being produced 
> because the original node heartbeats still claim to be from the primary node.
> On original Primary Node I see these messages constantly:2016-02-01 
> 16:05:39,635 INFO [Process NCM Request-2] 
> o.a.n.c.p.impl.SocketProtocolListener Received request 
> c5f13d5b-0f09-4fe2-885e-d2d597339491 from ec2-x.x.x.x.compute.amazonaws.com
> On NCM I see these messages constantly in app log:
> 2016-02-01 11:08:25,636 INFO [Process Pending Heartbeats] 
> o.a.n.c.manager.impl.WebClusterManager Node Event: 
> [id=b996c3c0-996c-4072-ba5a-434294d72036, 
> apiAddress=ec2-x.x.x.x.compute.amazonaws.com, apiPort=8443, 
> socketAddress=ec2-x.x.x.x.compute.amazonaws.com, socketPort=10000] -- 
> 'Heartbeat indicates node is running as primary node.  Revoking primary role 
> because primary role is assigned to a different node.'
> The original primary Node is no longer running and "on primary node" 
> processors; however, it appears the heartbeat message is not being updated to 
> reflect that it is not still the primary node.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to