[ 
https://issues.apache.org/jira/browse/PIG-3047?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Aniket Mokashi reassigned PIG-3047:
-----------------------------------

    Assignee: Aniket Mokashi  (was: Ido Hadanny)

> Check the size of a relation before adding it to distributed cache in 
> Replicated join
> -------------------------------------------------------------------------------------
>
>                 Key: PIG-3047
>                 URL: https://issues.apache.org/jira/browse/PIG-3047
>             Project: Pig
>          Issue Type: Improvement
>            Reporter: Julien Le Dem
>            Assignee: Aniket Mokashi
>             Fix For: 0.12.1
>
>         Attachments: fix_pig_3047_1.diff, fix_pig_3047.diff, 
> PIG-3047-2.patch, PIG-3047-3.patch
>
>
> Right now if someone makes a mistake and put the large relation last, Pig 
> will copy a huge file into distributed cache and it will take a long time 
> before the job eventually fails. It would be better to check before copying 
> the relation that it is of reasonable size.
> <1 GB ?



--
This message was sent by Atlassian JIRA
(v6.1#6144)

Reply via email to