[ https://issues.apache.org/jira/browse/PIG-3047?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13796809#comment-13796809 ]
Ran Yuchtman commented on PIG-3047: ----------------------------------- we only implemented option 3 in Prashant suggestion (with deafult of 1G), but it should be pretty easy to implement the rest > Check the size of a relation before adding it to distributed cache in > Replicated join > ------------------------------------------------------------------------------------- > > Key: PIG-3047 > URL: https://issues.apache.org/jira/browse/PIG-3047 > Project: Pig > Issue Type: Improvement > Reporter: Julien Le Dem > Assignee: Ido Hadanny > Fix For: 0.12.1 > > Attachments: fix_pig_3047.diff > > > Right now if someone makes a mistake and put the large relation last, Pig > will copy a huge file into distributed cache and it will take a long time > before the job eventually fails. It would be better to check before copying > the relation that it is of reasonable size. > <1 GB ? -- This message was sent by Atlassian JIRA (v6.1#6144)