[ 
https://issues.apache.org/jira/browse/HADOOP-5659?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12718665#action_12718665
 ] 

Matei Zaharia commented on HADOOP-5659:
---------------------------------------

Hi Dhruba,

A question about this feature: How do you want to decide when to kill the 
wellcare job? Do you want the tasks specified as a %, as a fixed number, or 
what?

Matei

> Fair share schduler may support preemption only with a specific pool
> --------------------------------------------------------------------
>
>                 Key: HADOOP-5659
>                 URL: https://issues.apache.org/jira/browse/HADOOP-5659
>             Project: Hadoop Core
>          Issue Type: Improvement
>          Components: contrib/fair-share
>            Reporter: dhruba borthakur
>
> There are a set of jobs that helps to keep the cluster resources being used 
> optimally. For example, there are data sets that are made of a multiple files 
> in a directory. These part-xxx files could be concatenated to a relatively 
> few files (to reduce memory  pressure on the namenode). Also, there are files 
> that could be compressed more efficiently (e.g. bzip2) to reduce save on disk 
> usage. These are kind of system-wellcare jobs that should run only if it does 
> not impact any other "real" user of the cluster. On an idle cluster, these 
> wellcare jobs should use all availale system resources. When a real user 
> submits a job, the wellcare job(s) should be pre-empted. If a scheduler can 
> support pre-emption only for jobs in a specified pool, then I can submit 
> these well-care jobs to that special pool. Real user's jobs will never get 
> pre-empted;but the wellcare jobs can get pre-empted as soon as there is 
> resource contention. If a task of well-care jobs is pre-empted more than a 
> configured max, the entire wellcare job will fail.. that this is the 
> behaviour I want. The wellcare jobs would run in idle slots as long as all 
> user-submitted jobs have been satisfied, but would be preempted as soon as 
> user jobs require any of those slots.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to