[
https://issues.apache.org/jira/browse/TEZ-1344?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14147545#comment-14147545
]
Jeff Zhang edited comment on TEZ-1344 at 9/25/14 8:35 AM:
----------------------------------------------------------
[~apivovarov]
should change this line (WordCount.java)
{code}
DAGStatus dagStatus = dagClient.waitForCompletionWithStatusUpdates(null);
{code}
to
{code}
DAGStatus dagStatus =
dagClient.waitForCompletionWithStatusUpdates(EnumSet.of(StatusGetOpts.GET_COUNTERS));
{code}
was (Author: zjffdu):
[~apivovarov]
should change this line
{code}
DAGStatus dagStatus = dagClient.waitForCompletionWithStatusUpdates(null);
{code}
to
{code}
DAGStatus dagStatus =
dagClient.waitForCompletionWithStatusUpdates(EnumSet.of(StatusGetOpts.GET_COUNTERS));
{code}
> Combiner counters reported by Tez look wrong
> --------------------------------------------
>
> Key: TEZ-1344
> URL: https://issues.apache.org/jira/browse/TEZ-1344
> Project: Apache Tez
> Issue Type: Bug
> Reporter: Cheolsoo Park
> Priority: Minor
>
> Combiner input/output counters reported by a Tez job seems wrong
> {code}
> org.apache.hadoop.mapreduce.TaskCounter:
> COMBINE_OUTPUT_RECORDS 35,977,263,353
> COMBINE_INPUT_RECORDS 1,000,529,333
> {code}
> As can be seen, combiner output records > input records?!
> The same counters from a MR job looks as follows-
> {code}
> Map-Reduce Framework:
> Combine output records 1,000,316,600
> Combine input records 35,977,049,632
> {code}
> Somehow input and output are swapped?
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)