PERFORMANCE: Sampler for order bys does not produce a good distribution -----------------------------------------------------------------------
Key: PIG-545 URL: https://issues.apache.org/jira/browse/PIG-545 Project: Pig Issue Type: Bug Components: impl Affects Versions: types_branch Reporter: Alan Gates Fix For: types_branch In running tests on actual data, I've noticed that the final reduce of an order by has skewed partitions. Some reduces finish in a few seconds while some run for 20 minutes. Getting a better distribution should lead to much better performance for order by. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.