[ 
https://issues.apache.org/jira/browse/SPARK-9438?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15141835#comment-15141835
 ] 

Thomas Demoor commented on SPARK-9438:
--------------------------------------

We have witnessed this as well in 1.3. Losing the ZK leader takes down the 
active spark master. 



> restarting leader zookeeper causes spark master to die when the spark master 
> election is assigned to zookeeper
> --------------------------------------------------------------------------------------------------------------
>
>                 Key: SPARK-9438
>                 URL: https://issues.apache.org/jira/browse/SPARK-9438
>             Project: Spark
>          Issue Type: Bug
>          Components: Spark Core
>    Affects Versions: 1.2.0
>         Environment: Saprk 1.2.0 and Zookeeper version: 3.4.6-1569965
>            Reporter: Amir Rad
>
> When Spark Master Election is assigned to Zookeeper, restarting the leader 
> Zookeeper causes the master spark to die. 
> Steps to reproduce:
> create a cluster of 3 spark nodes. 
> set Spark-env to:
> SPARK_LOCAL_DIRS="/home/sparkcde/data_spark/data"
> SPARK_MASTER_OPTS="-Dspark.deploy.spreadOut=false"
> SPARK_WORKER_DIR="/home/sparkcde/data_spark/worker"
> SPARK_WORKER_OPTS="-Dspark.worker.cleanup.enabled=true"
> SPARK_DAEMON_JAVA_OPTS="-Dspark.deploy.recoveryMode=ZOOKEEPER 
> -Dspark.deploy.zookeeper.url=s1:2181,s2:2181,s3:2181"
> Identify the spark master
> identify the zookeeper leader. 
> Stop zookeeper leader
> check spark master: It is dead
> start zookeeper leader
> check spark master: still dead
> If you continue the same pattern of stopping and starting zookeeper leader, 
> eventually you will lose the whole spark cluster.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to