Joe,

Our current dev/cert/prod flows run on NiFi 1.2.0 and this is where we face 
this issue a lot (hence why we created the first version of the monitoring 
flow). We have stood up new clusters which are running NiFi 1.8.0. No 
users/flows are running on the new 1.8.0 clusters besides our Admin monitoring 
flows. So no we don’t face this issue on 1.8.0, but we also have no users yet. 
We are opening up the 1.8.0 clusters to our users this week for them to start 
porting over their existing flows to the 1.8.0 clusters.

-Chad

From: Joe Witt <[email protected]>
Reply-To: "[email protected]" <[email protected]>
Date: Monday, December 10, 2018 at 10:23 AM
To: "[email protected]" <[email protected]>
Subject: Re: How to get Node ID of disconnected node to be used in a 
cluster/node monitoring flow

chad

in 1.8.0 you should not see much need to script that.  are you still seeing 
disconnects in 1.8.0 that it doesnt restore on its own.

thanks

On Mon, Dec 10, 2018, 10:08 AM Woodhead, Chad 
<[email protected]<mailto:[email protected]> wrote:
Hey Jeremy!

We occasionally see a node disconnect from the cluster (not disappear from the 
cluster) so I’m building this flow just to reconnect a disconnected node. The 
cluster still shows the node, but just that it is in a disconnected state.

As far as the API call to /nifi-api/controller/cluster, yes I get the same 
result if I call the API on one of the connected nodes. If I call the API of a 
disconnected node, I get “Only a node connected to a cluster can process the 
request.” And since the flowfile is sometimes run and processed on the 
disconnected node, using ${hostname(true)} in the API to 
/nifi-api/controller/cluster won’t work.

I am also using NiFi 1.8.0 for reference as well.

-Chad

From: Jeremy Dyer <[email protected]<mailto:[email protected]>>
Reply-To: "[email protected]<mailto:[email protected]>" 
<[email protected]<mailto:[email protected]>>
Date: Monday, December 10, 2018 at 9:12 AM
To: "[email protected]<mailto:[email protected]>" 
<[email protected]<mailto:[email protected]>>
Subject: Re: How to get Node ID of disconnected node to be used in a 
cluster/node monitoring flow

*External Message* - Use caution before opening links or attachments


Chad - The API call to /nifi-api/controller/cluster will return all of the 
nodes even if they are disconnected. The JSON response will indicate which node 
is "disconnected". I have included a screenshot of a simple environment I have 
setup for this where I just tested it. However I'm thinking since you are 
performing monitoring here you might have a deeper issue you are trying to work 
to monitor for. Are you experiencing situations where you are seeing nodes 
completely disappear from the cluster and not just be in the "disconnected" 
state? If so that is a different issue but if they are just "disconnected" you 
should be able to see that as the screenshot below illustrates. I am using NiFi 
1.8.0 just for reference.

On Mon, Dec 10, 2018 at 8:47 AM Woodhead, Chad 
<[email protected]<mailto:[email protected]>> wrote:
I am working on a cluster/node monitoring flow and was looking for some 
guidance.

I currently have a working flow that monitors if a node gets disconnected from 
the cluster and sends an email if a node disconnects.

Here is the flow:
(A SiteToSiteBulletinReportingTask sends to the ClusterMonitoring input port)
Error! Filename not specified.

The CheckAPIForDisconnectedNode process group makes an API call to NiFi confirm 
the node is still disconnected before sending the email.

This flow works as expected. I am now wanting to update the flow to 
automatically attempt to reconnect the node itself. When a node disconnects, I 
capture the server name of that node. To reconnect a node, you can call the API 
with PUT /nifi-api/controller/cluster/nodes/{id}, but the problem is I don’t 
know the Node ID. The only place I have found I can get the Node ID is via a 
GET /nifi-api/controller/cluster, but that API only works on a node that is 
connected to the cluster and the way this flow works, the processing can and 
sometimes does actually run on the node that has been disconnected. Is there a 
way to force it so the processing is only done on a cluster connected node and 
then use ${hostname(true)} in the API call, or is there another way I can get 
the Node ID, even on a disconnected node? I don’t want to have to hard code all 
of my Node ID’s somewhere.

Thanks,
Chad Woodhead



Reply via email to