[
https://issues.apache.org/jira/browse/TEZ-3486?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15616535#comment-15616535
]
Rohini Palaniswamy commented on TEZ-3486:
-----------------------------------------
First2OldReducer and First2NewReducer - They seem to be producing output only
when the combiner is called for the first two times. Please change them to
produce values based on the input and not how many times the combiner is
invoked.
{code}
private static class LimitOldReducer extends OldReducer {
protected void reduce(Text key, Iterable<IntWritable> values,
Context context) throws IOException, InterruptedException {
int i = 0;
// Limits to two records
for (IntWritable value : values) {
if (i++ < 2) {
context.write(new Text(key.toString()), value);
} else {
break;
}
}
}
}
{code}
> COMBINE_OUTPUT_RECORDS/COMBINE_INPUT_RECORDS are not correct
> ------------------------------------------------------------
>
> Key: TEZ-3486
> URL: https://issues.apache.org/jira/browse/TEZ-3486
> Project: Apache Tez
> Issue Type: Bug
> Reporter: Eric Badger
> Assignee: Eric Badger
> Attachments: TEZ-3486.001.patch, TEZ-3486.002.patch
>
>
> They appear to be swapped, though that is not quite the case. You can
> consistently see {{COMBINE_INPUT_RECORDS}} < {{COMBINE_OUTPUT_RECORDS}},
> which shouldn't be possible.
> This issue relates to TEZ-1344, which was closed due to not being
> reproducible, but it actually is fairly easily reproducible if you run the
> same job on Tez and on MR. For whatever reason, I cannot reopen that Jira, so
> I'm creating this one.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)