[jira] [Commented] (NIFI-14125) Clearing state with Kubernetes state provider causes cluster nodes to reconnect

Peter Kimberley (Jira) Thu, 02 Jan 2025 04:06:10 -0800


    [ 
https://issues.apache.org/jira/browse/NIFI-14125?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17909294#comment-17909294
 ]


Peter Kimberley commented on NIFI-14125:
----------------------------------------

After examining the logs further, I suspect what may be happening is the 
Fabric8 Kubernetes Client is returning HTTP status 409 (conflict) due to all 
nodes attempting to update the state _ConfigMap_ concurrently.

Perhaps only the coordinating node should be responsible for performing the 
update, where a cluster-scoped state is involved?

Relevant log entries follow. Coordinating node in this case is _test-nifi-1._

{{*test-nifi-0 (nifi-request.log):*}}
{{10.42.115.29 - test.user [02/Jan/2025:11:49:59 +0000] "POST 
/nifi-api/processors/1b5f4233-0194-1000-0000-00003610e403/state/clear-requests 
HTTP/2.0" *409* 149 "https://nifi.example.com/nifi/"; "Apache NiFi/2.1.0"}}

{{*test-nifi-1 (nifi-app.log):*}}
{{2025-01-02 22:49:59,291 WARN [Replicate Request Thread-176] 
o.a.n.c.c.node.NodeClusterCoordinator The following nodes failed to process URI 
/nifi-api/processors/1b5f4233-0194-1000-0000-00003610e403/state/clear-requests 
'[test-nifi-0.test-nifi.nifi:8443, test-nifi-2.test-nifi.nifi:8443]'.  
Requesting each node reconnect to cluster.}}

{{*test-nifi-2 (nifi-request.log):*}}
{{10.42.115.29 - test.user [02/Jan/2025:11:49:59 +0000] "POST 
/nifi-api/processors/1b5f4233-0194-1000-0000-00003610e403/state/clear-requests 
HTTP/2.0" *409* 149 "https://nifi.example.com/nifi/"; "Apache NiFi/2.1.0"}}

> Clearing state with Kubernetes state provider causes cluster nodes to 
> reconnect
> -------------------------------------------------------------------------------
>
>                 Key: NIFI-14125
>                 URL: https://issues.apache.org/jira/browse/NIFI-14125
>             Project: Apache NiFi
>          Issue Type: Bug
>          Components: Core Framework
>    Affects Versions: 2.1.0
>         Environment: Kubernetes
>            Reporter: Peter Kimberley
>            Priority: Major
>         Attachments: image-2025-01-01-19-35-01-388.png
>
>
> When using the Kubernetes state provider implementation, clearing a 
> component's state via the UI results in the following error to be logged by 
> NiFi:
> {{2025-01-01 19:10:30,773 WARN [Replicate Request Thread-260] 
> o.a.n.c.c.node.NodeClusterCoordinator The following nodes failed to process 
> URI 
> /nifi-api/processors/1b5f4233-0194-1000-0000-00003610e403/state/clear-requests
>  '[test-nifi-2.test-nifi.nifi:8443, test-nifi-1.test-nifi.nifi:8443]'. 
> Requesting each node reconnect to cluster.}}
> Following this, the coordinating node is re-elected and all nodes reconnect, 
> as indicated by error bulletins in the UI:
> !image-2025-01-01-19-35-01-388.png!
> This issue is not present when using the Zookeeper state provider.
> Inspecting the browser's API request to the _clear-requests_ endpoint, the 
> request is reported as a success (status code 200).



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

[jira] [Commented] (NIFI-14125) Clearing state with Kubernetes state provider causes cluster nodes to reconnect

Reply via email to