[
https://issues.apache.org/jira/browse/PIG-1838?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12989481#comment-12989481
]
Allen Wittenauer commented on PIG-1838:
---------------------------------------
I have a few thoughts on a better/programmatic way for Pig to be better behaved
without depending on users doing the right thing. One or more of these would
probably work:
a) Redefine java.io.tmpdir itself after it gets the Hadoop property files loaded
b) In the pig wrapper script, parse mapred-site.xml and pull out the mapred tmp
space
c) Override Java's createTempFile method to use Hadoop's tmp
location/$TEMPDIR/$TMPDIR/$TEMP/some other value
d) Change the jar assembly such that it goes into a
create->submit->delete->repeat pattern. (From a casual glance, it appears to
create all the jars at once rather than just when needed.)
> On a large farm, some pigs die of /tmp starvation
> -------------------------------------------------
>
> Key: PIG-1838
> URL: https://issues.apache.org/jira/browse/PIG-1838
> Project: Pig
> Issue Type: Wish
> Components: impl
> Affects Versions: 0.8.0
> Reporter: Allen Wittenauer
>
> We're starting to issues where interactive/command line pig users blow up due
> to so many large jar creations in /tmp. (In other words, pig execution prior
> to the java.io.tmpdir fix that Hadoop makes can kick in.) Pig should
> probably not depend upon users being savvy enough to override java.io.tmpdir
> on their own in these situations and/or a better steward of the space it does
> use.
--
This message is automatically generated by JIRA.
-
For more information on JIRA, see: http://www.atlassian.com/software/jira