[ 
https://issues.apache.org/jira/browse/HDFS-5098?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13745373#comment-13745373
 ] 

Bikas Saha commented on HDFS-5098:
----------------------------------

Its good to see that the information is already available for HDFS. However, in 
MR we work at the FileSystem abstraction layer. And we use 
FileSystem.getAllStatistics() to get the statistics. Hence, we cannot depend on 
calling HdfsDataInputStream. It would be great if that information was passed 
up the to the FileSystem.Statistics layer. I guessing thats which the 
Statistics object exists.
                
> Enhance FileSystem.Statistics to have locality information
> ----------------------------------------------------------
>
>                 Key: HDFS-5098
>                 URL: https://issues.apache.org/jira/browse/HDFS-5098
>             Project: Hadoop HDFS
>          Issue Type: Improvement
>            Reporter: Bikas Saha
>             Fix For: 2.1.1-beta
>
>
> Currently in MR/Tez we dont have a good and accurate means to detect how much 
> the the IO was actually done locally. Getting this information from the 
> source of truth would be much better.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Reply via email to