[jira] Commented: (HADOOP-5186) Improve limit handling in fairshare scheduler

Hemanth Yamijala (JIRA) Sun, 08 Feb 2009 19:41:22 -0800

    [ 
https://issues.apache.org/jira/browse/HADOOP-5186?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12671717#action_12671717
 ]


Hemanth Yamijala commented on HADOOP-5186:
------------------------------------------

bq. If all tasks from all initialized jobs in the pool are running, then 
initialize further jobs.

Matei, waiting until all the jobs become running to initialize further jobs may 
result in some under utilization still, if we only fire initialization when we 
determine that all previous jobs are running. This is because job 
initialization is expected to take some time, as it involves DFS access for 
localizing the job, as well as running the 'Setup task' (which is handled 
transparently by the Jobtracker). Hence, maybe it is a better idea to 
pre-initialize a few additional jobs (could be very small), and keep that 
number a constant. This way we are still bounded, but also have a backlog of 
jobs to immediately schedule tasks from, if all other jobs become running by 
then. Does this make sense ?



> Improve limit handling in fairshare scheduler
> ---------------------------------------------
>
>                 Key: HADOOP-5186
>                 URL: https://issues.apache.org/jira/browse/HADOOP-5186
>             Project: Hadoop Core
>          Issue Type: Improvement
>          Components: contrib/fair-share
>            Reporter: Hemanth Yamijala
>            Priority: Minor
>
> The fairshare scheduler has a way by which it can limit the number of jobs in 
> a pool by setting the maxRunningJobs parameter in its allocations definition. 
> This limit is treated as a hard limit, and comes into effect even if the 
> cluster is free to run more jobs, resulting in underutilization. Possibly the 
> same thing happens with the parameter maxRunningJobs for user and 
> userMaxJobsDefault. It may help to treat these as a soft limit and run 
> additional jobs to keep the cluster fully utilized.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Commented: (HADOOP-5186) Improve limit handling in fairshare scheduler

Reply via email to