[
https://issues.apache.org/jira/browse/SPARK-7941?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14564396#comment-14564396
]
Sean Owen commented on SPARK-7941:
----------------------------------
I am not an expert on this bit of YARN, but, this looks like app data from the
hadoop user, rather than you or yarn or a spark user. Is it Spark-related? Are
these containers still running actually? Container data may stick around for a
retry too in some cases. I also thought YARN would eventually clean this up
regardless of what happened in the container.
> Cache Cleanup Failure when job is killed by Spark
> --------------------------------------------------
>
> Key: SPARK-7941
> URL: https://issues.apache.org/jira/browse/SPARK-7941
> Project: Spark
> Issue Type: Bug
> Components: PySpark, YARN
> Affects Versions: 1.3.1
> Reporter: Cory Nguyen
> Attachments: screenshot-1.png
>
>
> Problem/Bug:
> If a job is running and Spark kills the job intentionally, the cache files
> remains on the local/worker nodes and are not cleaned up properly. Over time
> the old cache builds up and causes "No Space Left on Device" error.
> The cache is cleaned up properly when the job succeeds. I have not verified
> if the cached remains when the user intentionally kills the job.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]