Hi,
Spark standalone mode does not use or rely on ZooKeeper by default. The
Spark master and workers communicate directly with each other without using
ZooKeeper. However, it appears that in your case you are relying on
ZooKeeper to provide high availability for your standalone cluster. By
I am using spark-3.4.1 I have a setup with three ZooKeeper servers, Spark
master shuts down when a Zookeeper instance is down a new master is elected
as leader and the cluster is up. But the original master that was down
never comes up. can you please help me with this issue?
Stackoverflow link:-
Looking at Master.scala, I don't see code that would bring master back up
automatically.
Probably you can implement monitoring tool so that you get some alert when
master goes down.
e.g.
http://stackoverflow.com/questions/12896998/how-to-set-up-alerts-on-ganglia
More experienced users may have
Hi Ted,
Thanks for the pointers. I had a three node zookeeper setup . Now the
master alone dies when a zookeeper instance is down and a new master is
elected as leader and the cluster is up.
But the master that was down , never comes up.
Is this the expected ? Is there a way to get alert when a
Please see some blog w.r.t. the number of nodes in the quorum:
http://stackoverflow.com/questions/13022244/zookeeper-reliability-three-versus-five-nodes
http://www.ibm.com/developerworks/library/bd-zookeeper/
the paragraph starting with 'A quorum is represented by a strict majority
of nodes'
I am using zookeeper for providing HA for spark cluster. We have two nodes
zookeeper cluster.
When one of the zookeeper dies then the entire spark cluster goes down .
Is this expected behaviour ?
Am I missing something in config ?
Spark version - 1.6.1.
Zookeeper version - 3.4.6
//