[ 
https://issues.apache.org/jira/browse/SPARK-3110?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Hari Shreedharan updated SPARK-3110:
------------------------------------

    Issue Type: Sub-task  (was: Bug)
        Parent: SPARK-3129

> Add a "ha" mode in YARN mode to keep executors in between restarts
> ------------------------------------------------------------------
>
>                 Key: SPARK-3110
>                 URL: https://issues.apache.org/jira/browse/SPARK-3110
>             Project: Spark
>          Issue Type: Sub-task
>            Reporter: Hari Shreedharan
>
> The idea is for long running processes like streaming, you'd want the AM to 
> come back up and reuse the same executors, so it can get the blocks from the 
> memory of the executors because many streaming systems like Flume cannot 
> really replay the data once it has been taken out. Even for others which can, 
> the time period before data "expires" can mean some data could be lost. This 
> is the first step in a series of patches for this one. The next is to get the 
> AM to find the executors. My current plan is to use HDFS to keep track of 
> where the executors are running and then communicate to them via Akka, to get 
> a block list.
> I plan to expose this via SparkSubmit as the last step once we have all of 
> the other pieces in place.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to