[
https://issues.apache.org/jira/browse/HADOOP-2052?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Runping Qi updated HADOOP-2052:
-------------------------------
Description:
When the mappers of distcp finish, the status page in the web gui reports the
data copied.
However, the reported number is far away from the real number, which is very
misleading.
For example, a particular mapper task_200710131713_0001_m_000000_0 reported:
Finished. Bytes copied: 4.3g
However, it does not say which file.
I thought it was for part-00000. But the file size of part-00000
is about 9GB.
It will be much clearer if the status report say something like:
Finished copy file-xxxx: 4.3g
That way, I can easily check whether the size is correct.
was:
When the mappers of distcp finish, the status page in the web gui reports the
data copied.
However, the reported number is far away from the real number, which is very
misleading.
For example, a particular mapper reported:
Finished. Bytes copied: 4.3g
However, the actual file size is about 9GB.
> distcp mapper's status report misleading
> ----------------------------------------
>
> Key: HADOOP-2052
> URL: https://issues.apache.org/jira/browse/HADOOP-2052
> Project: Hadoop
> Issue Type: Bug
> Components: mapred
> Reporter: Runping Qi
>
> When the mappers of distcp finish, the status page in the web gui reports the
> data copied.
> However, the reported number is far away from the real number, which is very
> misleading.
> For example, a particular mapper task_200710131713_0001_m_000000_0 reported:
> Finished. Bytes copied: 4.3g
> However, it does not say which file.
> I thought it was for part-00000. But the file size of part-00000
> is about 9GB.
> It will be much clearer if the status report say something like:
> Finished copy file-xxxx: 4.3g
> That way, I can easily check whether the size is correct.
>
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.