[ https://issues.apache.org/jira/browse/ZOOKEEPER-4410?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17437112#comment-17437112 ]
TangHq commented on ZOOKEEPER-4410: ----------------------------------- os: ubuntu 20.04 zookeeper-server: 3.7.0 single client: python kazoo zoo.cfg: tickTime=10000 initLimit=10 syncLimit=10 dataDir=/data/zk_data clientPort=2181 maxClientCnxns=160000 Client TImeout: 3600s > Unable to read additional data from client, it probably closed the socket > ------------------------------------------------------------------------- > > Key: ZOOKEEPER-4410 > URL: https://issues.apache.org/jira/browse/ZOOKEEPER-4410 > Project: ZooKeeper > Issue Type: Bug > Components: server > Affects Versions: 3.7 > Environment: os: ubuntu 20.04 > zookeeper-server: 3.7.0 single > client: python kazoo > > zoo.cfg: > tickTime=10000 > initLimit=10 > syncLimit=10 > dataDir=/data/zk_data > clientPort=2181 > maxClientCnxns=160000 > > Client TImeout: 3600s > > Reporter: TangHq > Priority: Major > > Hi, > > When zookeeper is running, there will be a warning message in the log and > fter zookeeper has been running for a period of time, the connection between > the client and the server will be lost after about one or two days. > > There are my log report: > 2021-10-29 09:27:32,358 [myid:] - INFO [SessionTracker:ZooKeeperServer@628] - > Expiring session 0x10013a65a780163, timeout of 20000ms exceeded > 2021-10-29 09:27:47,827 [myid:] - WARN [NIOWorkerThread-84:NIOServerCnxn@371] > - Unexpected exception > EndOfStreamException: Unable to read additional data from client, it probably > closed the socket: address = /10.*.*.*:34658, session = 0x10013a65a780164 > at > org.apache.zookeeper.server.NIOServerCnxn.handleFailedRead(NIOServerCnxn.java:170) > at org.apache.zookeeper.server.NIOServerCnxn.doIO(NIOServerCnxn.java:333) > at > org.apache.zookeeper.server.NIOServerCnxnFactory$IOWorkRequest.doWork(NIOServerCnxnFactory.java:508) > at > org.apache.zookeeper.server.WorkerService$ScheduledWorkRequest.run(WorkerService.java:154) > at > java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149) > at > java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624) > at java.lang.Thread.run(Thread.java:748) > 2021-10-29 09:28:12,358 [myid:] - INFO [SessionTracker:ZooKeeperServer@628] - > Expiring session 0x10013a65a780164, timeout of 20000ms exceeded > 2021-10-29 09:30:14,272 [myid:] - WARN > [NIOWorkerThread-103:NIOServerCnxn@371] - Unexpected exception > EndOfStreamException: Unable to read additional data from client, it probably > closed the socket: address = /10.*.*.*:53590, session = 0x10013a65a780166 > at > org.apache.zookeeper.server.NIOServerCnxn.handleFailedRead(NIOServerCnxn.java:170) > at org.apache.zookeeper.server.NIOServerCnxn.doIO(NIOServerCnxn.java:333) > at > org.apache.zookeeper.server.NIOServerCnxnFactory$IOWorkRequest.doWork(NIOServerCnxnFactory.java:508) > at > org.apache.zookeeper.server.WorkerService$ScheduledWorkRequest.run(WorkerService.java:154) > at > java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149) > at > java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624) > at java.lang.Thread.run(Thread.java:748) > 2021-10-29 09:30:42,358 [myid:] - INFO [SessionTracker:ZooKeeperServer@628] - > Expiring session 0x10013a65a780166, timeout of 20000ms exceeded > > I search something related of this warn in internet and increased tickTime, > syncLimit, but it's also no effect. > > And the client error: > WARNING - Connection dropped: outstanding heartbeat ping not received\n' > WARNING - Transition to CONNECTING\n' > INFO - [stdout] : b'4: 2021-10-31 23:23:10,581 - INFO - Zookeeper connection > lost\n' > INFO - [stdout] : b'4: Traceback (most recent call last):\n' > INFO - [stdout] : b'4: File "/home/kazoo/cluster.py", line 376, in <module>\n' > INFO - [stdout] : b'4: test_cluster_config = sys.argv[1]\n' > INFO - [stdout] : b'4: File "/home/kazoo/cluster.py", line 219, in > update_cumulus\n' > INFO - [stdout] : b'4: if label_not_found:\n' > INFO - [stdout] : b'4: File "/home/kazoo/zkstate.py", line 25, in > processing\n' > INFO - [stdout] : b'4: return > self._zk.exists(self._path+"/"+self._name+"processing")\n' > INFO - [stdout] : b'4: File > "/usr/local/lib/python3.6/site-packages/kazoo/client.py", line 1123, in > exists\n' > INFO - [stdout] : b'4: return self.exists_async(path, watch=watch).get()\n' > INFO - [stdout] : b'4: File > "/usr/local/lib/python3.6/site-packages/kazoo/handlers/utils.py", line 75, in > get\n' > INFO - [stdout] : b'4: raise self._exception\n' > INFO - [stdout] : b'4: kazoo.exceptions.ConnectionLoss\n' > -- This message was sent by Atlassian Jira (v8.3.4#803005)