[jira] [Commented] (MAPREDUCE-207) Computing Input Splits on the MR Cluster

Gera Shegalov (JIRA) Tue, 27 May 2014 14:01:19 -0700

    [ 
https://issues.apache.org/jira/browse/MAPREDUCE-207?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14010259#comment-14010259
 ]


Gera Shegalov commented on MAPREDUCE-207:
-----------------------------------------

Assuming that TestPipeApplication is MAPREDUCE-5868, v05 is ready for review. 
The code can further be optimized to avoid reading splits back when they are 
written for the first time. We can incorporate it if the approach is accepted 
in general. There is plenty of coverage for job submission that helped shape 
the patch. Since it's mere refactoring, no new functional tests are urgently 
needed. 

> Computing Input Splits on the MR Cluster
> ----------------------------------------
>
>                 Key: MAPREDUCE-207
>                 URL: https://issues.apache.org/jira/browse/MAPREDUCE-207
>             Project: Hadoop Map/Reduce
>          Issue Type: New Feature
>          Components: applicationmaster, mrv2
>            Reporter: Philip Zeyliger
>            Assignee: Arun C Murthy
>         Attachments: MAPREDUCE-207.patch, MAPREDUCE-207.v02.patch, 
> MAPREDUCE-207.v03.patch, MAPREDUCE-207.v05.patch
>
>
> Instead of computing the input splits as part of job submission, Hadoop could 
> have a separate "job task type" that computes the input splits, therefore 
> allowing that computation to happen on the cluster.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

[jira] [Commented] (MAPREDUCE-207) Computing Input Splits on the MR Cluster

Reply via email to