[ 
https://issues.apache.org/jira/browse/MAPREDUCE-5583?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13795443#comment-13795443
 ] 

Arun C Murthy commented on MAPREDUCE-5583:
------------------------------------------

Cluster with 100,000 containers, 1,000 jobs, each with 100000 tasks, and 
specifies that they can only run 5 tasks. So, you are now only using 5% of the 
cluster and no one makes progress leading to very poor utilization and 
peanut-buttering effect.

Admittedly it's a contrived example and yes, I agree a user can hack his own AM 
to do this - but let's not make this trivial for normal users. This leads to 
all sorts of bad side-effects by supporting it out of the box.

Some form of admin control (e.g. queue with a max-cap) for a small number of 
use-cases where you *actually* need this feature is much safer.

> Ability to limit running map and reduce tasks
> ---------------------------------------------
>
>                 Key: MAPREDUCE-5583
>                 URL: https://issues.apache.org/jira/browse/MAPREDUCE-5583
>             Project: Hadoop Map/Reduce
>          Issue Type: Improvement
>          Components: mr-am, mrv2
>    Affects Versions: 0.23.9, 2.1.1-beta
>            Reporter: Jason Lowe
>
> It would be nice if users could specify a limit to the number of map or 
> reduce tasks that are running simultaneously.  Occasionally users are 
> performing operations in tasks that can lead to DDoS scenarios if too many 
> tasks run simultaneously (e.g.: accessing a database, web service, etc.).  
> Having the ability to throttle the number of tasks simultaneously running 
> would provide users a way to mitigate issues with too many tasks on a large 
> cluster attempting to access a serivce at any one time.
> This is similar to the functionality requested by MAPREDUCE-224 and 
> implemented by HADOOP-3412 but was dropped in mrv2.



--
This message was sent by Atlassian JIRA
(v6.1#6144)

Reply via email to