[
https://issues.apache.org/jira/browse/NIFI-12685?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Giovanni resolved NIFI-12685.
-----------------------------
Resolution: Not A Bug
After further analysis I found a recurring error on the primary node:
2024-01-31 15:14:21,158 ERROR [Load-Balanced Client Thread-3]
o.a.n.c.q.c.c.a.n.NioAsyncLoadBalanceClient Unable to connect to
tor1nifi1.skl.one:8443 for load balancing
It was due to iptables that was blocking port 6342
(nifi.cluster.load.balance.port) on every node. Adding the rule to iptables
resolved the issue.
> NiFi Cluster - java.net.SocketTimeoutException: timeout
> --------------------------------------------------------
>
> Key: NIFI-12685
> URL: https://issues.apache.org/jira/browse/NIFI-12685
> Project: Apache NiFi
> Issue Type: Bug
> Affects Versions: 1.22.0
> Reporter: Giovanni
> Priority: Blocker
> Attachments: image-2024-01-30-10-19-30-651.png
>
>
> Hi,
> I have a 3 nodes cluster and every time I try to edit a workflow one of the
> nodes goes timeout and UI return the following error:
> !image-2024-01-30-10-19-30-651.png!
>
> At the same time one of the nodes reports the following log in the
> nifi-app.log:
> {code:java}
> 2024-01-30 09:11:28,489 WARN [Replicate Request Thread-496]
> o.a.n.c.c.h.r.ThreadPoolRequestReplicator
> java.net.SocketTimeoutException: timeout
> at okio.SocketAsyncTimeout.newTimeoutException(JvmOkio.kt:147)
> at okio.AsyncTimeout.access$newTimeoutException(AsyncTimeout.kt:158)
> at okio.AsyncTimeout$source$1.read(AsyncTimeout.kt:337)
> at okio.RealBufferedSource.indexOf(RealBufferedSource.kt:427)
> at
> okio.RealBufferedSource.readUtf8LineStrict(RealBufferedSource.kt:320)
> at okhttp3.internal.http1.HeadersReader.readLine(HeadersReader.kt:29)
> at
> okhttp3.internal.http1.Http1ExchangeCodec.readResponseHeaders(Http1ExchangeCodec.kt:180)
> at
> okhttp3.internal.connection.Exchange.readResponseHeaders(Exchange.kt:110)
> at
> okhttp3.internal.http.CallServerInterceptor.intercept(CallServerInterceptor.kt:93)
> at
> okhttp3.internal.http.RealInterceptorChain.proceed(RealInterceptorChain.kt:109)
> at
> okhttp3.internal.connection.ConnectInterceptor.intercept(ConnectInterceptor.kt:34)
> at
> okhttp3.internal.http.RealInterceptorChain.proceed(RealInterceptorChain.kt:109)
> at
> okhttp3.internal.cache.CacheInterceptor.intercept(CacheInterceptor.kt:95)
> at
> okhttp3.internal.http.RealInterceptorChain.proceed(RealInterceptorChain.kt:109)
> at
> okhttp3.internal.http.BridgeInterceptor.intercept(BridgeInterceptor.kt:83)
> at
> okhttp3.internal.http.RealInterceptorChain.proceed(RealInterceptorChain.kt:109)
> at
> okhttp3.internal.http.RetryAndFollowUpInterceptor.intercept(RetryAndFollowUpInterceptor.kt:76)
> at
> okhttp3.internal.http.RealInterceptorChain.proceed(RealInterceptorChain.kt:109)
> at
> okhttp3.internal.connection.RealCall.getResponseWithInterceptorChain$okhttp(RealCall.kt:201)
> at okhttp3.internal.connection.RealCall.execute(RealCall.kt:154)
> at
> org.apache.nifi.cluster.coordination.http.replication.okhttp.OkHttpReplicationClient.replicate(OkHttpReplicationClient.java:136)
> at
> org.apache.nifi.cluster.coordination.http.replication.okhttp.OkHttpReplicationClient.replicate(OkHttpReplicationClient.java:130)
> at
> org.apache.nifi.cluster.coordination.http.replication.ThreadPoolRequestReplicator.replicateRequest(ThreadPoolRequestReplicator.java:645)
> at
> org.apache.nifi.cluster.coordination.http.replication.ThreadPoolRequestReplicator$NodeHttpRequest.run(ThreadPoolRequestReplicator.java:869)
> at
> java.base/java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:515)
> at java.base/java.util.concurrent.FutureTask.run(FutureTask.java:264)
> at
> java.base/java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1128)
> at
> java.base/java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:628)
> at java.base/java.lang.Thread.run(Thread.java:829)
> Caused by: java.net.SocketException: Socket closed
> at
> java.base/java.net.SocketInputStream.read(SocketInputStream.java:183)
> at
> java.base/java.net.SocketInputStream.read(SocketInputStream.java:140)
> at
> java.base/sun.security.ssl.SSLSocketInputRecord.read(SSLSocketInputRecord.java:484)
> at
> java.base/sun.security.ssl.SSLSocketInputRecord.readHeader(SSLSocketInputRecord.java:478)
> at
> java.base/sun.security.ssl.SSLSocketInputRecord.bytesInCompletePacket(SSLSocketInputRecord.java:70)
> at
> java.base/sun.security.ssl.SSLSocketImpl.readApplicationRecord(SSLSocketImpl.java:1455)
> at
> java.base/sun.security.ssl.SSLSocketImpl$AppInputStream.read(SSLSocketImpl.java:1066)
> at okio.InputStreamSource.read(JvmOkio.kt:94)
> at okio.AsyncTimeout$source$1.read(AsyncTimeout.kt:125)
> ... 26 common frames omitted {code}
> As I thought it was a performance issue, I doubled each nodes resources but
> the error is still occurring.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)