[
https://issues.apache.org/jira/browse/PIG-1062?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12779573#action_12779573
]
Thejas M Nair commented on PIG-1062:
Instead of adding the num-rows information as a last
[
https://issues.apache.org/jira/browse/PIG-1062?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12779054#action_12779054
]
Thejas M Nair commented on PIG-1062:
{quote}
In SampleLoader.java
[
https://issues.apache.org/jira/browse/PIG-1062?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12779126#action_12779126
]
Thejas M Nair commented on PIG-1062:
Yes, I think SampleLoader.getNext() can be moved to
[
https://issues.apache.org/jira/browse/PIG-1062?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12778654#action_12778654
]
Pradeep Kamath commented on PIG-1062:
-
Review comments:
In SampleLoader.java
[
https://issues.apache.org/jira/browse/PIG-1062?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12778666#action_12778666
]
Arun C Murthy commented on PIG-1062:
bq. It looks like ReduceContext has a getCounter()
[
https://issues.apache.org/jira/browse/PIG-1062?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=1294#action_1294
]
Hadoop QA commented on PIG-1062:
-1 overall. Here are the results of testing the latest
[
https://issues.apache.org/jira/browse/PIG-1062?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12777352#action_12777352
]
Hadoop QA commented on PIG-1062:
-1 overall. Here are the results of testing the latest
[
https://issues.apache.org/jira/browse/PIG-1062?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12776526#action_12776526
]
Thejas M Nair commented on PIG-1062:
Proposal for sampling in RandomSampleLoader (as well
[
https://issues.apache.org/jira/browse/PIG-1062?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12776035#action_12776035
]
Thejas M Nair commented on PIG-1062:
bq. You can get the same info from the counters
[
https://issues.apache.org/jira/browse/PIG-1062?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12776052#action_12776052
]
Dmitriy V. Ryaboy commented on PIG-1062:
It looks like ReduceContext has a
[
https://issues.apache.org/jira/browse/PIG-1062?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12772565#action_12772565
]
Thejas M Nair commented on PIG-1062:
WeightedRangePartitioner.setConf use of fileSize()
[
https://issues.apache.org/jira/browse/PIG-1062?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12772623#action_12772623
]
Thejas M Nair commented on PIG-1062:
Even after the interface changes, pig can compute
[
https://issues.apache.org/jira/browse/PIG-1062?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12772704#action_12772704
]
Thejas M Nair commented on PIG-1062:
As indicated in previous comment, I am planning to
[
https://issues.apache.org/jira/browse/PIG-1062?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12772797#action_12772797
]
Dmitriy V. Ryaboy commented on PIG-1062:
The sampler (in this design) reads all the
[
https://issues.apache.org/jira/browse/PIG-1062?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12772807#action_12772807
]
Dmitriy V. Ryaboy commented on PIG-1062:
Thejas:
bq. sending a special tuple with
[
https://issues.apache.org/jira/browse/PIG-1062?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12772017#action_12772017
]
Dmitriy V. Ryaboy commented on PIG-1062:
I have ResourceStats hooked up to
[
https://issues.apache.org/jira/browse/PIG-1062?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12772197#action_12772197
]
Thejas M Nair commented on PIG-1062:
Dmitriy,
I had overlooked the fact that input size
[
https://issues.apache.org/jira/browse/PIG-1062?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12771615#action_12771615
]
Thejas M Nair commented on PIG-1062:
Skew-join uses the total number of input tuples, in
18 matches
Mail list logo