[
https://issues.apache.org/jira/browse/HADOOP-5186?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12688252#action_12688252
]
Vinod K V commented on HADOOP-5186:
-----------------------------------
One more issue with the current model of limiting by number of jobs per pool is
the under-utilization of map slots while reduce tasks are running. If N is the
maximum number of jobs that can run in a particular pool, when all the maps of
first N jobs are done and only the reduces are running, the map slots sit idle
and the maps of next N batch of jobs cannot yet be scheduled. The fix for this
is to change the limits to be in terms of tasks/slots instead of the number of
jobs.
> Improve limit handling in fairshare scheduler
> ---------------------------------------------
>
> Key: HADOOP-5186
> URL: https://issues.apache.org/jira/browse/HADOOP-5186
> Project: Hadoop Core
> Issue Type: Improvement
> Components: contrib/fair-share
> Reporter: Hemanth Yamijala
> Priority: Minor
>
> The fairshare scheduler has a way by which it can limit the number of jobs in
> a pool by setting the maxRunningJobs parameter in its allocations definition.
> This limit is treated as a hard limit, and comes into effect even if the
> cluster is free to run more jobs, resulting in underutilization. Possibly the
> same thing happens with the parameter maxRunningJobs for user and
> userMaxJobsDefault. It may help to treat these as a soft limit and run
> additional jobs to keep the cluster fully utilized.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.