[ 
https://issues.apache.org/jira/browse/NIFI-12685?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17812750#comment-17812750
 ] 

Giovanni edited comment on NIFI-12685 at 1/31/24 3:27 PM:
----------------------------------------------------------

After further analysis I found a recurring error on the primary node:
{code:java}
2024-01-31 15:14:21,158 ERROR [Load-Balanced Client Thread-3] 
o.a.n.c.q.c.c.a.n.NioAsyncLoadBalanceClient Unable to connect to 
tor1nifi1.skl.one:8443 for load balancing {code}
It was due to iptables that was blocking port 6342 
(nifi.cluster.load.balance.port) on every node. Adding the rule to iptables 
resolved the issue.

 

 


was (Author: JIRAUSER300198):
After further analysis I found a recurring error on the primary node:

2024-01-31 15:14:21,158 ERROR [Load-Balanced Client Thread-3] 
o.a.n.c.q.c.c.a.n.NioAsyncLoadBalanceClient Unable to connect to 
tor1nifi1.skl.one:8443 for load balancing

It was due to iptables that was blocking port 6342 
(nifi.cluster.load.balance.port) on every node. Adding the rule to iptables 
resolved the issue.



 

 

> NiFi Cluster - java.net.SocketTimeoutException: timeout 
> --------------------------------------------------------
>
>                 Key: NIFI-12685
>                 URL: https://issues.apache.org/jira/browse/NIFI-12685
>             Project: Apache NiFi
>          Issue Type: Bug
>    Affects Versions: 1.22.0
>            Reporter: Giovanni
>            Priority: Blocker
>         Attachments: image-2024-01-30-10-19-30-651.png
>
>
> Hi,
> I have a 3 nodes cluster and every time I try to edit a workflow one of the 
> nodes goes timeout and UI return the following error:
> !image-2024-01-30-10-19-30-651.png!
>  
> At the same time one of the nodes reports the following log in the 
> nifi-app.log:
> {code:java}
> 2024-01-30 09:11:28,489 WARN [Replicate Request Thread-496] 
> o.a.n.c.c.h.r.ThreadPoolRequestReplicator 
> java.net.SocketTimeoutException: timeout
>         at okio.SocketAsyncTimeout.newTimeoutException(JvmOkio.kt:147)
>         at okio.AsyncTimeout.access$newTimeoutException(AsyncTimeout.kt:158)
>         at okio.AsyncTimeout$source$1.read(AsyncTimeout.kt:337)
>         at okio.RealBufferedSource.indexOf(RealBufferedSource.kt:427)
>         at 
> okio.RealBufferedSource.readUtf8LineStrict(RealBufferedSource.kt:320)
>         at okhttp3.internal.http1.HeadersReader.readLine(HeadersReader.kt:29)
>         at 
> okhttp3.internal.http1.Http1ExchangeCodec.readResponseHeaders(Http1ExchangeCodec.kt:180)
>         at 
> okhttp3.internal.connection.Exchange.readResponseHeaders(Exchange.kt:110)
>         at 
> okhttp3.internal.http.CallServerInterceptor.intercept(CallServerInterceptor.kt:93)
>         at 
> okhttp3.internal.http.RealInterceptorChain.proceed(RealInterceptorChain.kt:109)
>         at 
> okhttp3.internal.connection.ConnectInterceptor.intercept(ConnectInterceptor.kt:34)
>         at 
> okhttp3.internal.http.RealInterceptorChain.proceed(RealInterceptorChain.kt:109)
>         at 
> okhttp3.internal.cache.CacheInterceptor.intercept(CacheInterceptor.kt:95)
>         at 
> okhttp3.internal.http.RealInterceptorChain.proceed(RealInterceptorChain.kt:109)
>         at 
> okhttp3.internal.http.BridgeInterceptor.intercept(BridgeInterceptor.kt:83)
>         at 
> okhttp3.internal.http.RealInterceptorChain.proceed(RealInterceptorChain.kt:109)
>         at 
> okhttp3.internal.http.RetryAndFollowUpInterceptor.intercept(RetryAndFollowUpInterceptor.kt:76)
>         at 
> okhttp3.internal.http.RealInterceptorChain.proceed(RealInterceptorChain.kt:109)
>         at 
> okhttp3.internal.connection.RealCall.getResponseWithInterceptorChain$okhttp(RealCall.kt:201)
>         at okhttp3.internal.connection.RealCall.execute(RealCall.kt:154)
>         at 
> org.apache.nifi.cluster.coordination.http.replication.okhttp.OkHttpReplicationClient.replicate(OkHttpReplicationClient.java:136)
>         at 
> org.apache.nifi.cluster.coordination.http.replication.okhttp.OkHttpReplicationClient.replicate(OkHttpReplicationClient.java:130)
>         at 
> org.apache.nifi.cluster.coordination.http.replication.ThreadPoolRequestReplicator.replicateRequest(ThreadPoolRequestReplicator.java:645)
>         at 
> org.apache.nifi.cluster.coordination.http.replication.ThreadPoolRequestReplicator$NodeHttpRequest.run(ThreadPoolRequestReplicator.java:869)
>         at 
> java.base/java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:515)
>         at java.base/java.util.concurrent.FutureTask.run(FutureTask.java:264)
>         at 
> java.base/java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1128)
>         at 
> java.base/java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:628)
>         at java.base/java.lang.Thread.run(Thread.java:829)
> Caused by: java.net.SocketException: Socket closed
>         at 
> java.base/java.net.SocketInputStream.read(SocketInputStream.java:183)
>         at 
> java.base/java.net.SocketInputStream.read(SocketInputStream.java:140)
>         at 
> java.base/sun.security.ssl.SSLSocketInputRecord.read(SSLSocketInputRecord.java:484)
>         at 
> java.base/sun.security.ssl.SSLSocketInputRecord.readHeader(SSLSocketInputRecord.java:478)
>         at 
> java.base/sun.security.ssl.SSLSocketInputRecord.bytesInCompletePacket(SSLSocketInputRecord.java:70)
>         at 
> java.base/sun.security.ssl.SSLSocketImpl.readApplicationRecord(SSLSocketImpl.java:1455)
>         at 
> java.base/sun.security.ssl.SSLSocketImpl$AppInputStream.read(SSLSocketImpl.java:1066)
>         at okio.InputStreamSource.read(JvmOkio.kt:94)
>         at okio.AsyncTimeout$source$1.read(AsyncTimeout.kt:125)
>         ... 26 common frames omitted {code}
> As I thought it was a performance issue, I doubled each nodes resources but 
> the error is still occurring.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to