Introduce a mechanism to notify if a larger relation is given for replicated
join
----------------------------------------------------------------------------------
Key: PIG-2519
URL: https://issues.apache.org/jira/browse/PIG-2519
Project: Pig
Issue Type: Improvement
Affects Versions: 0.9.1
Reporter: Vivek Padmanabhan
In a replicated join if a huge relation is mentioned wrongly on the right side,
the job may start filling up the disks and eventually impact the overall
cluster.
Furthermore, the document for replicated join says,
"The small relations must be small enough to fit into main memory; if they
don't, the process fails and an error is generated."
http://pig.apache.org/docs/r0.8.1/piglatin_ref1.html#Replicated+Joins
It would be nice to have a mechanism to fail fast this sort of scenarios or may
be have some warning messages to notify this.
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators:
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira