[ 
https://issues.apache.org/jira/browse/MAPREDUCE-3902?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Arun C Murthy updated MAPREDUCE-3902:
-------------------------------------

    Summary: MR AM should reuse containers for map tasks, there-by allowing 
fine-grained control on num-maps for users without need for 
CombineFileInputFormat etc.  (was: MR AM should reuse containers for map tasks)

bq. Is there a cap on the amount of re-use?  For example, if the container has 
been in use for more than 1 minute then do not re-use it.

Not currently, but we could add something like this - except it won't make too 
much difference since you need to run the remaining maps in other containers 
anyway! :)

bq. Or to rephrase, what prevents a cluster with a few large jobs from having 
hogged containers?

The central scheduler (e.g CapacityScheduler) already uses queue-capacities and 
user-limits, (and in future, preemption) to prevent this.
                
> MR AM should reuse containers for map tasks, there-by allowing fine-grained 
> control on num-maps for users without need for CombineFileInputFormat etc.
> ------------------------------------------------------------------------------------------------------------------------------------------------------
>
>                 Key: MAPREDUCE-3902
>                 URL: https://issues.apache.org/jira/browse/MAPREDUCE-3902
>             Project: Hadoop Map/Reduce
>          Issue Type: Improvement
>          Components: applicationmaster, mrv2
>            Reporter: Arun C Murthy
>            Assignee: Arun C Murthy
>         Attachments: MAPREDUCE-3902.patch
>
>
> The MR AM is now in a great position to reuse containers across (map) tasks. 
> This is something similar to JVM re-use we had in 0.20.x, but in a 
> significantly better manner:
> # Consider data-locality when re-using containers
> # Consider the new shuffle - ensure that reduces fetch output of the whole 
> container at once (i.e. all maps) 

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Reply via email to