[jira] [Commented] (NIFI-11450) Node respecting nifi.flowcontroller.autoResumeState when joining cluster

Nissim Shiman (Jira) Fri, 02 May 2025 09:00:05 -0700


    [ 
https://issues.apache.org/jira/browse/NIFI-11450?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17948934#comment-17948934
 ]


Nissim Shiman commented on NIFI-11450:
--------------------------------------

This issue is still the case on 2.4.0.
Set up three node cluster with single processor running.
On one node, set nifi.flowcontroller.autoResumeState to be false

*1.* If that node is stopped, flow.json.gz removed, and node started, processor 
will be in ENABLED state on this node only. (and will remain RUNNING on other 
nodes)
This is the case of this ticket.

*2.* If, instead, node was restarted (i.e. keeping flow.json.gz), processor 
will usually also be in ENABLED state on this node only, but sometimes will be 
RUNNING (i.e. even with local setting of autoResumeState of false).
About 70% of the time it will be ENABLED on the restart. 
On the other nodes it will remain RUNNING

*3.* If, instead, node is disconnected and reconnected (via gui), the processor 
will be in the RUNNING state (i.e. it will ignore local setting of 
autoResumeState of false).

*4.* In a worse case scenerio, if the node with autoResumeState with false 
comes up with its Processors as ENABLED and then becomes the primary node and 
the other nodes are disconnected/reconnected
then processors will be ENABLED on all nodes of the cluster, despite 
autoResumeState being true on all but one node.

All of these cases are edge cases as users should set 
nifi.flowcontroller.autoResumeState consistently on all nodes, but what should 
behavior be if they are not the same?

[~markap14] and/or [~exceptionfactory]   Would you have any thoughts where 
nifi.flowcontroller.autoResumeState is not the set the same on all nodes?

> Node respecting nifi.flowcontroller.autoResumeState when joining cluster
> ------------------------------------------------------------------------
>
>                 Key: NIFI-11450
>                 URL: https://issues.apache.org/jira/browse/NIFI-11450
>             Project: Apache NiFi
>          Issue Type: Bug
>          Components: Core Framework
>    Affects Versions: 1.19.1
>         Environment: Nifi version 1.19.1.  Linux.  JDK11+
>            Reporter: Phil Lord
>            Assignee: Nissim Shiman
>            Priority: Critical
>
> Mistakenly tried to connect a node to an active cluster that had the 
> following setting: nifi.flowcontroller.autoResumeState=true .
> The flow.xml/json were both removed completely prior to attempting to connect 
> node to the cluster, then started the node.  The node connected to the 
> cluster without issue, however all components/controllerServices on that node 
> were in a stopped state(due to the property mentioned above).
> Is this correct behavior given that the flow that was inherited from the 
> cluster had components started/etc.  I would think the node should either 
> fail to join the cluster/start?
> This was running Nifi 1.19.1... unclear if it's resolved in latest release.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

[jira] [Commented] (NIFI-11450) Node respecting nifi.flowcontroller.autoResumeState when joining cluster

Reply via email to