[
https://issues.apache.org/jira/browse/SPARK-15619?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15304911#comment-15304911
]
shane knapp commented on SPARK-15619:
-------------------------------------
next time we have a maintenance, i will wipe /tmp completely so that we can at
least try and see what's creating what... right now it's such a mess that it's
hard to attribute anything to anything.
i did watch as a spark build (spark-master-test-maven-hadoop-2.7, IIRC) dump a
bunch of the liblz4-java3891256912513794605.so files in /tmp this morning on
worker-08 (the number string changes for each file).
and, so far today, we've had 2628 of these files left in /tmp:
[root@amp-jenkins-worker-08 tmp]# ls -lt | grep liblz4 |grep "May 27" | wc -l
2628
i'm not worried about us running out of disk, and this is something i can
manage on the system-level, but it'd still be nice to have well behaved tests.
:)
> spark builds filling up /tmp
> ----------------------------
>
> Key: SPARK-15619
> URL: https://issues.apache.org/jira/browse/SPARK-15619
> Project: Spark
> Issue Type: Bug
> Components: Build
> Reporter: shane knapp
> Priority: Minor
>
> spark builds aren't cleaning up /tmp after they run... it's hard to pinpoint
> EXACTLY what is left there by the spark builds (as other builds are also
> guilty of doing this), but a quick perusal of the /tmp directory during some
> spark builds show that there are myriad empty directories being created and a
> massive pile of shared object libraries being dumped there.
> $ for x in $(cat jenkins_workers.txt ); do echo $x; ssh $x "ls -l /tmp/*.so |
> wc -l"; done
> amp-jenkins-worker-01
> 0
> ls: cannot access /tmp/*.so: No such file or directory
> amp-jenkins-worker-02
> 22312
> amp-jenkins-worker-03
> 39673
> amp-jenkins-worker-04
> 39548
> amp-jenkins-worker-05
> 39577
> amp-jenkins-worker-06
> 39299
> amp-jenkins-worker-07
> 39315
> amp-jenkins-worker-08
> 38529
> to help combat this, i set up a cron job on each worker that runs tmpwatch
> during system downtime on sundays to clean up files older than a week.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]