TangHq created ZOOKEEPER-4410: --------------------------------- Summary: Unable to read additional data from client, it probably closed the socket Key: ZOOKEEPER-4410 URL: https://issues.apache.org/jira/browse/ZOOKEEPER-4410 Project: ZooKeeper Issue Type: Bug Components: server Affects Versions: 3.7 Environment: os: ubuntu 20.04
zookeeper-server: 3.7.0 single client: python kazoo zoo.cfg: tickTime=10000 initLimit=10 syncLimit=10 dataDir=/data/zk_data clientPort=2181 maxClientCnxns=160000 Client TImeout: 3600s Reporter: TangHq Hi, When zookeeper is running, there will be a warning message in the log and fter zookeeper has been running for a period of time, the connection between the client and the server will be lost after about one or two days. There are my log report: 2021-10-29 09:27:32,358 [myid:] - INFO [SessionTracker:ZooKeeperServer@628] - Expiring session 0x10013a65a780163, timeout of 20000ms exceeded 2021-10-29 09:27:47,827 [myid:] - WARN [NIOWorkerThread-84:NIOServerCnxn@371] - Unexpected exception EndOfStreamException: Unable to read additional data from client, it probably closed the socket: address = /10.*.*.*:34658, session = 0x10013a65a780164 at org.apache.zookeeper.server.NIOServerCnxn.handleFailedRead(NIOServerCnxn.java:170) at org.apache.zookeeper.server.NIOServerCnxn.doIO(NIOServerCnxn.java:333) at org.apache.zookeeper.server.NIOServerCnxnFactory$IOWorkRequest.doWork(NIOServerCnxnFactory.java:508) at org.apache.zookeeper.server.WorkerService$ScheduledWorkRequest.run(WorkerService.java:154) at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149) at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624) at java.lang.Thread.run(Thread.java:748) 2021-10-29 09:28:12,358 [myid:] - INFO [SessionTracker:ZooKeeperServer@628] - Expiring session 0x10013a65a780164, timeout of 20000ms exceeded 2021-10-29 09:30:14,272 [myid:] - WARN [NIOWorkerThread-103:NIOServerCnxn@371] - Unexpected exception EndOfStreamException: Unable to read additional data from client, it probably closed the socket: address = /10.*.*.*:53590, session = 0x10013a65a780166 at org.apache.zookeeper.server.NIOServerCnxn.handleFailedRead(NIOServerCnxn.java:170) at org.apache.zookeeper.server.NIOServerCnxn.doIO(NIOServerCnxn.java:333) at org.apache.zookeeper.server.NIOServerCnxnFactory$IOWorkRequest.doWork(NIOServerCnxnFactory.java:508) at org.apache.zookeeper.server.WorkerService$ScheduledWorkRequest.run(WorkerService.java:154) at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149) at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624) at java.lang.Thread.run(Thread.java:748) 2021-10-29 09:30:42,358 [myid:] - INFO [SessionTracker:ZooKeeperServer@628] - Expiring session 0x10013a65a780166, timeout of 20000ms exceeded I search something related of this warn in internet and increased tickTime, syncLimit, but it's also no effect. And the client error: WARNING - Connection dropped: outstanding heartbeat ping not received\n' WARNING - Transition to CONNECTING\n' INFO - [stdout] : b'4: 2021-10-31 23:23:10,581 - INFO - Zookeeper connection lost\n' INFO - [stdout] : b'4: Traceback (most recent call last):\n' INFO - [stdout] : b'4: File "/home/kazoo/cluster.py", line 376, in <module>\n' INFO - [stdout] : b'4: test_cluster_config = sys.argv[1]\n' INFO - [stdout] : b'4: File "/home/kazoo/cluster.py", line 219, in update_cumulus\n' INFO - [stdout] : b'4: if label_not_found:\n' INFO - [stdout] : b'4: File "/home/kazoo/zkstate.py", line 25, in processing\n' INFO - [stdout] : b'4: return self._zk.exists(self._path+"/"+self._name+"processing")\n' INFO - [stdout] : b'4: File "/usr/local/lib/python3.6/site-packages/kazoo/client.py", line 1123, in exists\n' INFO - [stdout] : b'4: return self.exists_async(path, watch=watch).get()\n' INFO - [stdout] : b'4: File "/usr/local/lib/python3.6/site-packages/kazoo/handlers/utils.py", line 75, in get\n' INFO - [stdout] : b'4: raise self._exception\n' INFO - [stdout] : b'4: kazoo.exceptions.ConnectionLoss\n' -- This message was sent by Atlassian Jira (v8.3.4#803005)