[jira] [Commented] (NIFI-12221) Make heartbeat responses more lenient in some cases

ASF subversion and git services (Jira) Fri, 13 Oct 2023 14:00:04 -0700


    [ 
https://issues.apache.org/jira/browse/NIFI-12221?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17775062#comment-17775062
 ]


ASF subversion and git services commented on NIFI-12221:
--------------------------------------------------------

Commit f4ae292a457638d3226fb0491e5186fa52ae8518 in nifi's branch 
refs/heads/main from Mark Payne
[ https://gitbox.apache.org/repos/asf?p=nifi.git;h=f4ae292a45 ]

NIFI-12221: This closes #7876. Be more lenient about which Disconnection Codes 
we allow a node to be reconnected to a cluster vs. when we notify the node to 
disconnect again. Also updated the timeout for OffloadIT because it 
occasionally times ou out while running properly.

Signed-off-by: Joseph Witt <joew...@apache.org>


> Make heartbeat responses more lenient in some cases
> ---------------------------------------------------
>
>                 Key: NIFI-12221
>                 URL: https://issues.apache.org/jira/browse/NIFI-12221
>             Project: Apache NiFi
>          Issue Type: Improvement
>          Components: Core Framework
>            Reporter: Mark Payne
>            Assignee: Mark Payne
>            Priority: Major
>             Fix For: 2.latest
>
>          Time Spent: 10m
>  Remaining Estimate: 0h
>
> When a heartbeat is received by the Cluster Coordinator, it responds based on 
> the node's current connection state. In the case of a disconnected node, it 
> either notifies the node that it is disconnected so that it will stop 
> hearting, or it requests the node to reconnect to the cluster.
> Due to changes that were made in 1.16, as well as a few additional changes 
> that have been made since, we can be much more lenient about when we ask the 
> node to reconnect vs. disconnect. For example, if a node was disconnected due 
> to not handling an update request, we previously needed to request that the 
> node disconnect again. However, now we can ask the node to reconnect, as it 
> may well be able to reconcile any differences and rejoin.
> We even currently request that a node disconnect if receiving a heartbeat 
> from a node whose last state was "Disconnected because Node was Shutdown". We 
> should definitely be more lenient in this case, as it's occasionally causing 
> System Test failures (e.g., 
> [https://github.com/apache/nifi/actions/runs/6498488206).|https://github.com/apache/nifi/actions/runs/6498488206)]



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

[jira] [Commented] (NIFI-12221) Make heartbeat responses more lenient in some cases

Reply via email to