[ https://issues.apache.org/jira/browse/HADOOP-2187?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12541943 ]
Doug Cutting commented on HADOOP-2187: -------------------------------------- +1 This sounds like a good change. I might instead call the class BlockLocations, and the methods getBlockLocations. When we deprecate the existing method ideally we can upgrade all existing implementations with a single back-compatibility implementation on the base class. Also, should we refer to hosts by hostname or IP here? > FileSystem should return location information with byte ranges > -------------------------------------------------------------- > > Key: HADOOP-2187 > URL: https://issues.apache.org/jira/browse/HADOOP-2187 > Project: Hadoop > Issue Type: Improvement > Components: fs > Reporter: Owen O'Malley > Fix For: 0.16.0 > > > The FileSystem interface should provide location information with byte ranges > rather than a String[][] of locations. I suggest that we deprecate > FileSystem.getFileCacheHints and replace it with: > {code} > abstract public class FileSystem { > ... > public static class BlockInformation implements Writable { > public BlockInformation(long start, String[] locations) {...} > public String[] getHosts() {...} > public long getStartingOffset() {...} > } > BlockInformation[] getFileLocations(Path f, long start, long length) { ... > } > } > {code} > This will allow us to fix the FileInputFormat in map/reduce to make just one > call per a file to the name node instead of one per a block. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.