zengqiuyang created SPARK-9629:
----------------------------------
Summary: Client session timed out, have not heard from server in
Key: SPARK-9629
URL: https://issues.apache.org/jira/browse/SPARK-9629
Project: Spark
Issue Type: Bug
Components: Deploy
Affects Versions: 1.4.1, 1.4.0
Environment: spark1.4.1 ./make-distribution.sh --tgz
-Dhadoop.version=2.5.2 -Dyarn.version=2.5.2 -Phive -Phive-thriftserver -Pyarn
zookeeper-3.4.6.tar.gz
Reporter: zengqiuyang
Priority: Critical
the spark HA running every few days , then " Client session timed out"
appear。
show reconnect but not do it, and master shutting down.
logs:
15/08/05 05:32:57 INFO zookeeper.ClientCnxn: Client session timed out, have
not heard from server in 37753ms for sessionid 0x34ee39684b70005, closing
socket connection and attempting reconnect
15/08/05 05:32:57 INFO state.ConnectionStateManager: State change: SUSPENDED
15/08/05 05:32:57 WARN state.ConnectionStateManager: There are no
ConnectionStateListeners registered.
15/08/05 05:32:57 INFO zookeeper.ClientCnxn: Opening socket connection to
server h5/192.168.0.18:2181. Will not attempt to authenticate using SASL
(unknown error)
15/08/05 05:32:57 INFO zookeeper.ClientCnxn: Socket connection established to
h5/192.168.0.18:2181, initiating session
15/08/05 05:32:57 INFO zookeeper.ClientCnxn: Session establishment complete on
server h5/192.168.0.18:2181, sessionid = 0x34ee39684b70005, negotiated timeout
= 40000
15/08/05 05:32:57 INFO state.ConnectionStateManager: State change: RECONNECTED
15/08/05 05:32:57 WARN state.ConnectionStateManager: There are no
ConnectionStateListeners registered.
15/08/05 05:32:58 INFO zookeeper.ClientCnxn: Client session timed out, have not
heard from server in 37753ms for sessionid 0x34ee39684b70006, closing socket
connection and attempting reconnect
15/08/05 05:32:58 INFO state.ConnectionStateManager: State change: SUSPENDED
15/08/05 05:32:58 INFO master.ZooKeeperLeaderElectionAgent: We have lost
leadership
15/08/05 05:32:58 ERROR master.Master: Leadership has been revoked -- master
shutting down.
15/08/05 05:32:58 INFO util.Utils: Shutdown hook called
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]