[
https://issues.apache.org/jira/browse/SPARK-1860?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14152552#comment-14152552
]
Andrew Ash commented on SPARK-1860:
-----------------------------------
Cleanup on executor shutdown is part of the solution (and should be done IMO)
but not all of it.
Particularly it won't cover when an executor dies from an OOM or a kill -9 or
any other unclean shutdown. The perfect solution would do the event-based
cleanup self on executor shutdown, and also a periodic cleaner to get rid of
directories that were shutdown uncleanly.
> Standalone Worker cleanup should not clean up running executors
> ---------------------------------------------------------------
>
> Key: SPARK-1860
> URL: https://issues.apache.org/jira/browse/SPARK-1860
> Project: Spark
> Issue Type: Bug
> Components: Deploy
> Affects Versions: 1.0.0
> Reporter: Aaron Davidson
> Priority: Blocker
>
> The default values of the standalone worker cleanup code cleanup all
> application data every 7 days. This includes jars that were added to any
> executors that happen to be running for longer than 7 days, hitting streaming
> jobs especially hard.
> Executor's log/data folders should not be cleaned up if they're still
> running. Until then, this behavior should not be enabled by default.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]