[
https://issues.apache.org/jira/browse/HDFS-5098?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13745373#comment-13745373
]
Bikas Saha commented on HDFS-5098:
----------------------------------
Its good to see that the information is already available for HDFS. However, in
MR we work at the FileSystem abstraction layer. And we use
FileSystem.getAllStatistics() to get the statistics. Hence, we cannot depend on
calling HdfsDataInputStream. It would be great if that information was passed
up the to the FileSystem.Statistics layer. I guessing thats which the
Statistics object exists.
> Enhance FileSystem.Statistics to have locality information
> ----------------------------------------------------------
>
> Key: HDFS-5098
> URL: https://issues.apache.org/jira/browse/HDFS-5098
> Project: Hadoop HDFS
> Issue Type: Improvement
> Reporter: Bikas Saha
> Fix For: 2.1.1-beta
>
>
> Currently in MR/Tez we dont have a good and accurate means to detect how much
> the the IO was actually done locally. Getting this information from the
> source of truth would be much better.
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira