[jira] [Updated] (PIG-4148) Tez order-by is often skewed because FindQuantiles UDF is called with small number

2015-03-13 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4148?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai updated PIG-4148: Issue Type: Bug (was: Sub-task) Parent: (was: PIG-3446) Tez order-by is often skewed because

[jira] [Updated] (PIG-4148) Tez order-by is often skewed because FindQuantiles UDF is called with small number

2014-10-30 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4148?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai updated PIG-4148: Fix Version/s: (was: 0.14.0) 0.14.1 Push to 0.14.1 since I cannot find a reproducible

[jira] [Updated] (PIG-4148) Tez order-by is often skewed because FindQuantiles UDF is called with small number

2014-09-08 Thread Cheolsoo Park (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4148?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheolsoo Park updated PIG-4148: --- Attachment: popackage.log generate_sample.py samples_logs.tar.gz

[jira] [Updated] (PIG-4148) Tez order-by is often skewed because FindQuantiles UDF is called with small number

2014-09-08 Thread Cheolsoo Park (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4148?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheolsoo Park updated PIG-4148: --- Attachment: metric_retention.explain I am also attaching the explain output of my job. To summarize, it

[jira] [Updated] (PIG-4148) Tez order-by is often skewed because FindQuantiles UDF is called with small number

2014-09-01 Thread Cheolsoo Park (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4148?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheolsoo Park updated PIG-4148: --- Description: In Tez, FindQuantiles UDF is called with a smaller number of samples than MR resulting in

[jira] [Updated] (PIG-4148) Tez order-by is often skewed because FindQuantiles UDF is called with small number

2014-08-31 Thread Cheolsoo Park (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4148?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheolsoo Park updated PIG-4148: --- Attachment: PIG-4148-1.patch The patch changes the number of samples to parallelism x per-task sample

[jira] [Updated] (PIG-4148) Tez order-by is often skewed because FindQuantiles UDF is called with small number

2014-08-31 Thread Cheolsoo Park (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4148?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheolsoo Park updated PIG-4148: --- Attachment: (was: PIG-4148-1.patch) Tez order-by is often skewed because FindQuantiles UDF is