Olivier Tavard created ZOOKEEPER-4722: -----------------------------------------
Summary: Reproducible SessionTimeoutException for ZK3.5.7 and higher Key: ZOOKEEPER-4722 URL: https://issues.apache.org/jira/browse/ZOOKEEPER-4722 Project: ZooKeeper Issue Type: Bug Affects Versions: 3.5.7 Reporter: Olivier Tavard Hello Zookeeper team, I am contributing to the Apache ManifoldCF project, which is heavily relying on Zookeeper, and I encountered a reproducible error for which we – the ManifoldCF team - need your help: As part of its output connectors regular updates (ManifoldCF is a data crawler that can send analysed documents to designated destination applications), we are upgrading the Apache Solr output connector so that it is compatible with Solr 9.2. This version of Solr uses Apache Zookeeper 3.8.0, but when using the same Zookeeper version on the Apache ManifoldCF side, we experienced reproducible tests failures on the Solr Output Connector of ManifoldCF when building the project. We were able to pass tests with ZK up to version 3.5.6 included, but all of the more recent versions fail [4]. To ease your evaluation of the problem, we have prepared a fork of Apache ManifoldCF where you can work and reproduce the issue [1]. The branch to be used is CONNECTORS-1740, and the build procedure is as follows (the issue occurs during the testings phase): git checkout CONNECTORS-1740 Now, in order to build with ant (exclusively with Java 11 otherwise it will not work) First, Fetch the dependencies : ant clean make-core-deps Next, start the build itself: ant clean build You will get a success message for both commands. Then start the MCF tests (this is what will trigger the error [4]): ant run-tests-framework Thanks in advance for your help. Best regards, Olivier [1] https://gitlab.datafari.com/sandboxespublic/manifoldcf You can find here the email thread about this problem on the Apache ManifoldCF developer Mailing List [2], as well as the jira issue dedicated to it [3]. [2] https://lists.apache.org/thread/3zx7c5nm23j6wfb0okf5xb01t9jb6lph [3] https://issues.apache.org/jira/browse/CONNECTORS-1740 [4] Error logs we get: [junit] org.apache.zookeeper.ClientCnxn$SessionTimeoutException: Client session timed out, have not heard from server in 2007ms for session id 0x0 [junit] at org.apache.zookeeper.ClientCnxn$SendThread.run(ClientCnxn.java:1250) [junit] [reader] INFO org.apache.zookeeper.ZooKeeper - Session: 0x0 closed [junit] [reader-EventThread] INFO org.apache.zookeeper.ClientCnxn - EventThread shut down for session: 0x0 [junit] [reader] INFO org.apache.zookeeper.ZooKeeper - Initiating client connection, connectString=localhost:8348 sessionTimeout=2000 watcher=org.apache.manifoldcf.core.lockmanager.ZooKeeperConnection$ZooKeeperWatcher@45c66a27 [junit] [reader] INFO org.apache.zookeeper.ClientCnxnSocket - jute.maxbuffer value is 1048575 Bytes [junit] [reader] INFO org.apache.zookeeper.ClientCnxn - zookeeper.request.timeout value is 0. feature enabled=false [junit] [reader-SendThread(localhost:8348)] INFO org.apache.zookeeper.ClientCnxn - Opening socket connection to server localhost/127.0.0.1:8348. [junit] [reader-SendThread(localhost:8348)] INFO org.apache.zookeeper.ClientCnxn - SASL config status: Will not attempt to authenticate using SASL (unknown error) [junit] [reader-SendThread(localhost:8348)] WARN org.apache.zookeeper.ClientCnxn - Client session timed out, have not heard from server in 2008ms for session id 0x0 [junit] [reader-SendThread(localhost:8348)] WARN org.apache.zookeeper.ClientCnxn - An exception was thrown while closing send thread for session 0x0. [junit] org.apache.zookeeper.ClientCnxn$SessionTimeoutException: Client session timed out, have not heard from server in 2008ms for session id 0x0 -- This message was sent by Atlassian Jira (v8.20.10#820010)