[ 
https://issues.apache.org/jira/browse/PIG-2519?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Ido Hadanny resolved PIG-2519.
------------------------------

    Resolution: Duplicate
      Assignee: Ido Hadanny

dup of PIG-3047

> Introduce a mechanism to notify if a larger relation is given for replicated 
> join 
> ----------------------------------------------------------------------------------
>
>                 Key: PIG-2519
>                 URL: https://issues.apache.org/jira/browse/PIG-2519
>             Project: Pig
>          Issue Type: Improvement
>    Affects Versions: 0.9.1
>            Reporter: Vivek Padmanabhan
>            Assignee: Ido Hadanny
>
> In a replicated join if a huge relation is mentioned wrongly on the right 
> side, the job may start filling up the disks and eventually impact the 
> overall cluster.
> Furthermore, the document for replicated join says,
> "The small relations must be small enough to fit into main memory; if they 
> don't, the process fails and an error is generated."
> http://pig.apache.org/docs/r0.8.1/piglatin_ref1.html#Replicated+Joins
> It would be nice to have a mechanism to fail fast this sort of scenarios or 
> may be have some warning messages to notify this. 



--
This message was sent by Atlassian JIRA
(v6.1#6144)

Reply via email to