https://issues.apache.org/jira/browse/PIG-3059
I wanted to make sure people saw this JIRA, as I think it will dramatically improve Pig. Discussion of this issue is available here: http://www.quora.com/Big-Data/In-Big-Data-ETL-how-many-records-are-an-acceptable-loss Russell Jurney http://datasyndrome.com
