[
https://issues.apache.org/jira/browse/HADOOP-933?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Owen O'Malley updated HADOOP-933:
---------------------------------
Status: Open (was: Patch Available)
I think a better solution would be to do HADOOP-867, which will have the client
write all of the input splits to disk. Then the MapTask would only need to keep
the Path to the split file and an offset. The only place the splits would be
instantiated would be the submitting program and the task jvm.
> Application defined InputSplits do not work
> -------------------------------------------
>
> Key: HADOOP-933
> URL: https://issues.apache.org/jira/browse/HADOOP-933
> Project: Hadoop
> Issue Type: Bug
> Components: mapred
> Affects Versions: 0.10.1
> Reporter: Benjamin Reed
> Fix For: 0.10.1
>
> Attachments: MapTask.patch
>
>
> If an application defines its own InputSplit, the task tracker chokes when it
> cannot deserialize the InputSplit when it deserializes MapTasks it receives
> from the JobTracker. This is because the TaskTracker does not resolve classes
> from the job jar file. The attached patch delays resolution of the InputSplit
> until it is running in the context of the child process where it can resolve
> the InputSplit class.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.