[jira] [Commented] (MAPREDUCE-207) Computing Input Splits on the MR Cluster

Ming Ma (JIRA) Fri, 27 Jun 2014 15:52:26 -0700

    [ 
https://issues.apache.org/jira/browse/MAPREDUCE-207?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14046497#comment-14046497
 ]


Ming Ma commented on MAPREDUCE-207:
-----------------------------------

Thanks, Gera. Nice work and this will be quite useful. Overall it looks good. 
Per offline discussion with Gera,

1. It is unclear if there is any security related implication such as 
https://issues.apache.org/jira/browse/MAPREDUCE-5663.
2. The compatibility between new MR client with this feature and cluster with 
old MR. Given new MR client won't compute the split by default; the job will 
fail if the cluster still uses old MR. So in this case, new MR client needs to 
be configured to compute split. For a more general case where new MR client can 
talk to some cluster with old MR and some cluster with new MR, it will be nice 
if client can discover if the cluster supports this feature.

> Computing Input Splits on the MR Cluster
> ----------------------------------------
>
>                 Key: MAPREDUCE-207
>                 URL: https://issues.apache.org/jira/browse/MAPREDUCE-207
>             Project: Hadoop Map/Reduce
>          Issue Type: New Feature
>          Components: applicationmaster, mrv2
>            Reporter: Philip Zeyliger
>            Assignee: Arun C Murthy
>         Attachments: MAPREDUCE-207.patch, MAPREDUCE-207.v02.patch, 
> MAPREDUCE-207.v03.patch, MAPREDUCE-207.v05.patch
>
>
> Instead of computing the input splits as part of job submission, Hadoop could 
> have a separate "job task type" that computes the input splits, therefore 
> allowing that computation to happen on the cluster.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

[jira] [Commented] (MAPREDUCE-207) Computing Input Splits on the MR Cluster

Reply via email to