[ 
https://issues.apache.org/jira/browse/PIG-3047?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13796809#comment-13796809
 ] 

Ran Yuchtman commented on PIG-3047:
-----------------------------------

we only implemented option 3 in Prashant suggestion (with deafult of 1G), but 
it should be pretty easy to implement the rest

> Check the size of a relation before adding it to distributed cache in 
> Replicated join
> -------------------------------------------------------------------------------------
>
>                 Key: PIG-3047
>                 URL: https://issues.apache.org/jira/browse/PIG-3047
>             Project: Pig
>          Issue Type: Improvement
>            Reporter: Julien Le Dem
>            Assignee: Ido Hadanny
>             Fix For: 0.12.1
>
>         Attachments: fix_pig_3047.diff
>
>
> Right now if someone makes a mistake and put the large relation last, Pig 
> will copy a huge file into distributed cache and it will take a long time 
> before the job eventually fails. It would be better to check before copying 
> the relation that it is of reasonable size.
> <1 GB ?



--
This message was sent by Atlassian JIRA
(v6.1#6144)

Reply via email to