[ 
https://issues.apache.org/jira/browse/SPARK-15142?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16099923#comment-16099923
 ] 

Stavros Kontopoulos edited comment on SPARK-15142 at 7/25/17 12:05 PM:
-----------------------------------------------------------------------

[~devaraj.k] Great, so one question to clarify things, if the dispatcher is 
running and then the master is restarted,
a DRIVER_ABORTED is fired and its handled properly for the newly submitted 
drivers? 
In your PR(https://github.com/apache/spark/pull/13143/files) I see you throw 
the exception only when you start the dispatcher and then things fail. This is 
code used by the dispacther as well so if you restart the dispacther and there 
is no master it will fail correct? And that means there is no way to detect the 
new master the dispatcher must be restarted when the new master is up.


was (Author: skonto):
[~devaraj.k] Great, so one question to clarify things, if the dispatcher is 
running and then the master is restarted,
a DRIVER_ABORTED is fired and its handled properly for the newly submitted 
drivers? In that PR you mention it says: "I think we need to throw the 
exception and shutdown the dispatcher." and so later some other PRs like 
(https://github.com/apache/spark/pull/13072) handle the shutdown process.
In your PR(https://github.com/apache/spark/pull/13143/files) I see you throw 
the exception only when you start the dispatcher and then things fail. This is 
code used by the dispacther as well so if you restart the dispacther and there 
is no master it will fail correct? And that means there is no way to detect the 
new master the dispatcher must be restarted when the new master is up.

> Spark Mesos dispatcher becomes unusable when the Mesos master restarts
> ----------------------------------------------------------------------
>
>                 Key: SPARK-15142
>                 URL: https://issues.apache.org/jira/browse/SPARK-15142
>             Project: Spark
>          Issue Type: Bug
>          Components: Deploy, Mesos
>            Reporter: Devaraj K
>            Priority: Minor
>         Attachments: 
> spark-devaraj-org.apache.spark.deploy.mesos.MesosClusterDispatcher-1-stobdtserver5.out
>
>
> While Spark Mesos dispatcher running if the Mesos master gets restarted then 
> Spark Mesos dispatcher will keep running and queues up all the submitted 
> applications and will not launch them.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to