[ https://issues.apache.org/jira/browse/NIFI-12685?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Giovanni resolved NIFI-12685. ----------------------------- Resolution: Not A Bug After further analysis I found a recurring error on the primary node: 2024-01-31 15:14:21,158 ERROR [Load-Balanced Client Thread-3] o.a.n.c.q.c.c.a.n.NioAsyncLoadBalanceClient Unable to connect to tor1nifi1.skl.one:8443 for load balancing It was due to iptables that was blocking port 6342 (nifi.cluster.load.balance.port) on every node. Adding the rule to iptables resolved the issue. > NiFi Cluster - java.net.SocketTimeoutException: timeout > -------------------------------------------------------- > > Key: NIFI-12685 > URL: https://issues.apache.org/jira/browse/NIFI-12685 > Project: Apache NiFi > Issue Type: Bug > Affects Versions: 1.22.0 > Reporter: Giovanni > Priority: Blocker > Attachments: image-2024-01-30-10-19-30-651.png > > > Hi, > I have a 3 nodes cluster and every time I try to edit a workflow one of the > nodes goes timeout and UI return the following error: > !image-2024-01-30-10-19-30-651.png! > > At the same time one of the nodes reports the following log in the > nifi-app.log: > {code:java} > 2024-01-30 09:11:28,489 WARN [Replicate Request Thread-496] > o.a.n.c.c.h.r.ThreadPoolRequestReplicator > java.net.SocketTimeoutException: timeout > at okio.SocketAsyncTimeout.newTimeoutException(JvmOkio.kt:147) > at okio.AsyncTimeout.access$newTimeoutException(AsyncTimeout.kt:158) > at okio.AsyncTimeout$source$1.read(AsyncTimeout.kt:337) > at okio.RealBufferedSource.indexOf(RealBufferedSource.kt:427) > at > okio.RealBufferedSource.readUtf8LineStrict(RealBufferedSource.kt:320) > at okhttp3.internal.http1.HeadersReader.readLine(HeadersReader.kt:29) > at > okhttp3.internal.http1.Http1ExchangeCodec.readResponseHeaders(Http1ExchangeCodec.kt:180) > at > okhttp3.internal.connection.Exchange.readResponseHeaders(Exchange.kt:110) > at > okhttp3.internal.http.CallServerInterceptor.intercept(CallServerInterceptor.kt:93) > at > okhttp3.internal.http.RealInterceptorChain.proceed(RealInterceptorChain.kt:109) > at > okhttp3.internal.connection.ConnectInterceptor.intercept(ConnectInterceptor.kt:34) > at > okhttp3.internal.http.RealInterceptorChain.proceed(RealInterceptorChain.kt:109) > at > okhttp3.internal.cache.CacheInterceptor.intercept(CacheInterceptor.kt:95) > at > okhttp3.internal.http.RealInterceptorChain.proceed(RealInterceptorChain.kt:109) > at > okhttp3.internal.http.BridgeInterceptor.intercept(BridgeInterceptor.kt:83) > at > okhttp3.internal.http.RealInterceptorChain.proceed(RealInterceptorChain.kt:109) > at > okhttp3.internal.http.RetryAndFollowUpInterceptor.intercept(RetryAndFollowUpInterceptor.kt:76) > at > okhttp3.internal.http.RealInterceptorChain.proceed(RealInterceptorChain.kt:109) > at > okhttp3.internal.connection.RealCall.getResponseWithInterceptorChain$okhttp(RealCall.kt:201) > at okhttp3.internal.connection.RealCall.execute(RealCall.kt:154) > at > org.apache.nifi.cluster.coordination.http.replication.okhttp.OkHttpReplicationClient.replicate(OkHttpReplicationClient.java:136) > at > org.apache.nifi.cluster.coordination.http.replication.okhttp.OkHttpReplicationClient.replicate(OkHttpReplicationClient.java:130) > at > org.apache.nifi.cluster.coordination.http.replication.ThreadPoolRequestReplicator.replicateRequest(ThreadPoolRequestReplicator.java:645) > at > org.apache.nifi.cluster.coordination.http.replication.ThreadPoolRequestReplicator$NodeHttpRequest.run(ThreadPoolRequestReplicator.java:869) > at > java.base/java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:515) > at java.base/java.util.concurrent.FutureTask.run(FutureTask.java:264) > at > java.base/java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1128) > at > java.base/java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:628) > at java.base/java.lang.Thread.run(Thread.java:829) > Caused by: java.net.SocketException: Socket closed > at > java.base/java.net.SocketInputStream.read(SocketInputStream.java:183) > at > java.base/java.net.SocketInputStream.read(SocketInputStream.java:140) > at > java.base/sun.security.ssl.SSLSocketInputRecord.read(SSLSocketInputRecord.java:484) > at > java.base/sun.security.ssl.SSLSocketInputRecord.readHeader(SSLSocketInputRecord.java:478) > at > java.base/sun.security.ssl.SSLSocketInputRecord.bytesInCompletePacket(SSLSocketInputRecord.java:70) > at > java.base/sun.security.ssl.SSLSocketImpl.readApplicationRecord(SSLSocketImpl.java:1455) > at > java.base/sun.security.ssl.SSLSocketImpl$AppInputStream.read(SSLSocketImpl.java:1066) > at okio.InputStreamSource.read(JvmOkio.kt:94) > at okio.AsyncTimeout$source$1.read(AsyncTimeout.kt:125) > ... 26 common frames omitted {code} > As I thought it was a performance issue, I doubled each nodes resources but > the error is still occurring. -- This message was sent by Atlassian Jira (v8.20.10#820010)