I had someone from Hortonworks suggest to me that I should also set any PutSQL 
processors to only execute on primary. The reasoning was due to flooding of the 
JDBC pool.

> On 9 Jan 2018, at 17:25, Joe Witt <joe.w...@gmail.com> wrote:
> 
> I'd avoid setting any processor to primary node only unless it is a
> source processor (something that brings data into the system).
> 
> But, yes, I believe your description is accurate as of now.
> 
> Thanks
> 
>> On Mon, Jan 8, 2018 at 11:21 PM, 尹文才 <batman...@gmail.com> wrote:
>> Thanks Joe, so you mean for example, if I set one processor to run only on
>> primary node in the cluster and there're 100 FlowFiles in the incoming
>> queue of the processor
>> waiting to be processed by this processor, and the processor suddenly goes
>> down and then another node is elected as the primary node, those 100
>> FlowFiles will be kept locally
>> in the node that went down and will continue to be processed by the node
>> when it goes back online, these FlowFiles will not be available to the new
>> primary node and other nodes,
>> am I correct?
>> 
>> Regards,
>> Ben
>> 
>> 
>> 2018-01-09 14:08 GMT+08:00 Joe Witt <joe.w...@gmail.com>:
>> 
>>> Ben,
>>> 
>>> Data already mid-flow within a node will be kept on the node and
>>> processed when the node is back on-line.  All other data coming into
>>> the cluster can fail-over to other nodes provided you're sourcing data
>>> with queuing semantics or automated load balancing or fail-over as-is
>>> present in the Apache NiFi Site to Site protocol.
>>> 
>>> Thanks
>>> Joe
>>> 
>>>> On Mon, Jan 8, 2018 at 11:05 PM, 尹文才 <batman...@gmail.com> wrote:
>>>> Hi guys, I have a question about data HA when NiFi is run in clustered
>>>> mode, if one node goes down, will the flowfiles owned by this node taken
>>>> over and processed by another node?
>>>> Or will the flowfiles be kept locally to that node and will only be
>>>> processed when that node is back online? Thanks.
>>>> 
>>>> Regards,
>>>> Ben
>>> 

Reply via email to