[ http://issues.apache.org/jira/browse/HADOOP-451?page=all ]
Doug Cutting updated HADOOP-451:
--------------------------------
Status: Resolved (was: Patch Available)
Resolution: Fixed
I just committed this. Thanks, Owen!
> Add a Split interface
> ---------------------
>
> Key: HADOOP-451
> URL: http://issues.apache.org/jira/browse/HADOOP-451
> Project: Hadoop
> Issue Type: Improvement
> Components: mapred
> Affects Versions: 0.9.2
> Reporter: Doug Cutting
> Assigned To: Owen O'Malley
> Fix For: 0.10.0
>
> Attachments: input-split-2.patch, input-split.patch
>
>
> The InputFormat interface has a method:
> FileSplit[] getSplits();
> This should change to:
> Split[] getSplits();
> The Split interface would look like:
> public interface Split extends Writable {
> /** Returns a list of hosts that contain this split.
> This is only used to optimize task placement, so this may be empty. */
> String[] getLocations(FileSystem fs);
> /** The relative, estimated cost of operating on this. Typically the size
> of the data in the split.
> Used to prioritize tasks in a job (high-cost tasks are run first). */
> long getCost();
> }
--
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the administrators:
http://issues.apache.org/jira/secure/Administrators.jspa
-
For more information on JIRA, see: http://www.atlassian.com/software/jira