[ 
https://issues.apache.org/jira/browse/MAPREDUCE-5896?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14004424#comment-14004424
 ] 

Sandy Ryza commented on MAPREDUCE-5896:
---------------------------------------

Given HDFS's plans for hierarchical storage management, I think it would be 
good to make this extensible to handle storage mediums beyond memory.  I talked 
this over with [~andrew.wang] and [~atm] and we think the right interface would 
be something like a SplitLocationInfo class, with isInMemory() and isOnDisk() 
methods.  We can later add isInFlash() and possibly even getDisk() to return 
which disk the data is on. 

InputSplits would have a "SplitLocationInfo[] getLocationInfo()" method that 
returns info about how the data is stored on each host returned by 
getLocations().

> Allow InputSplits to indicate which locations have the block cached in memory
> -----------------------------------------------------------------------------
>
>                 Key: MAPREDUCE-5896
>                 URL: https://issues.apache.org/jira/browse/MAPREDUCE-5896
>             Project: Hadoop Map/Reduce
>          Issue Type: Improvement
>    Affects Versions: 2.4.0
>            Reporter: Sandy Ryza
>            Assignee: Sandy Ryza
>




--
This message was sent by Atlassian JIRA
(v6.2#6252)

Reply via email to