[
https://issues.apache.org/jira/browse/PIG-2519?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Ido Hadanny resolved PIG-2519.
------------------------------
Resolution: Duplicate
Assignee: Ido Hadanny
dup of PIG-3047
> Introduce a mechanism to notify if a larger relation is given for replicated
> join
> ----------------------------------------------------------------------------------
>
> Key: PIG-2519
> URL: https://issues.apache.org/jira/browse/PIG-2519
> Project: Pig
> Issue Type: Improvement
> Affects Versions: 0.9.1
> Reporter: Vivek Padmanabhan
> Assignee: Ido Hadanny
>
> In a replicated join if a huge relation is mentioned wrongly on the right
> side, the job may start filling up the disks and eventually impact the
> overall cluster.
> Furthermore, the document for replicated join says,
> "The small relations must be small enough to fit into main memory; if they
> don't, the process fails and an error is generated."
> http://pig.apache.org/docs/r0.8.1/piglatin_ref1.html#Replicated+Joins
> It would be nice to have a mechanism to fail fast this sort of scenarios or
> may be have some warning messages to notify this.
--
This message was sent by Atlassian JIRA
(v6.1#6144)