[jira] [Commented] (SPARK-7941) Cache Cleanup Failure when job is killed by Spark

Sean Owen (JIRA) Fri, 29 May 2015 01:17:27 -0700

    [ 
https://issues.apache.org/jira/browse/SPARK-7941?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14564396#comment-14564396
 ]


Sean Owen commented on SPARK-7941:
----------------------------------

I am not an expert on this bit of YARN, but, this looks like app data from the 
hadoop user, rather than you or yarn or a spark user. Is it Spark-related? Are 
these containers still running actually? Container data may stick around for a 
retry too in some cases. I also thought YARN would eventually clean this up 
regardless of what happened in the container. 

> Cache Cleanup Failure when job is killed by Spark 
> --------------------------------------------------
>
>                 Key: SPARK-7941
>                 URL: https://issues.apache.org/jira/browse/SPARK-7941
>             Project: Spark
>          Issue Type: Bug
>          Components: PySpark, YARN
>    Affects Versions: 1.3.1
>            Reporter: Cory Nguyen
>         Attachments: screenshot-1.png
>
>
> Problem/Bug:
> If a job is running and Spark kills the job intentionally, the cache files 
> remains on the local/worker nodes and are not cleaned up properly. Over time 
> the old cache builds up and causes "No Space Left on Device" error. 
> The cache is cleaned up properly when the job succeeds. I have not verified 
> if the cached remains when the user intentionally kills the job. 



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

[jira] [Commented] (SPARK-7941) Cache Cleanup Failure when job is killed by Spark

Reply via email to