[ 
https://issues.apache.org/jira/browse/SPARK-6924?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Sean Owen updated SPARK-6924:
-----------------------------
    Priority: Minor  (was: Major)

> driver hangs when net is broken
> -------------------------------
>
>                 Key: SPARK-6924
>                 URL: https://issues.apache.org/jira/browse/SPARK-6924
>             Project: Spark
>          Issue Type: Bug
>          Components: Spark Core
>            Reporter: xukun
>            Priority: Minor
>
> In yarn-client mode, client is deployed out side of cluster. When the net 
> between client and cluster is broken, driver lost all executors. In normal 
> situation, client returns and app fails. Actually, the driver hangs, user do 
> not know  whether app is ok. So we should let driver return not hang.
> The solution: in HeartbeatReceiver thread, check whether some executor send 
> heartbeat to dirver at the fixed rate. If no execuor send heartbeats to 
> driver, close SparkContext. 



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to