[ http://issues.apache.org/jira/browse/HADOOP-451?page=comments#action_12430085 ] Doug Cutting commented on HADOOP-451: -------------------------------------
In a comment on HADOOP-372, Owen suggests that the getSplits() method should look like: Split[] getSplits(URI[], JobConf, int numMaps); I think that's a good idea. > Add a Split interface > --------------------- > > Key: HADOOP-451 > URL: http://issues.apache.org/jira/browse/HADOOP-451 > Project: Hadoop > Issue Type: Improvement > Components: mapred > Reporter: Doug Cutting > Fix For: 0.6.0 > > > The InputFormat interface has a method: > FileSplit[] getSplits(); > This should change to: > Split[] getSplits(); > The Split interface would look like: > public interface Split extends Writable { > /** Returns a list of hosts that contain this split. > This is only used to optimize task placement, so this may be empty. */ > String[] getLocations(FileSystem fs); > /** The relative, estimated cost of operating on this. Typically the size > of the data in the split. > Used to prioritize tasks in a job (high-cost tasks are run first). */ > long getCost(); > } -- This message is automatically generated by JIRA. - If you think it was sent incorrectly contact one of the administrators: http://issues.apache.org/jira/secure/Administrators.jspa - For more information on JIRA, see: http://www.atlassian.com/software/jira