[ 
https://issues.apache.org/jira/browse/MAPREDUCE-2205?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Scott Chen resolved MAPREDUCE-2205.
-----------------------------------

    Resolution: Not A Problem

> FairScheduler should not re-schedule jobs that have just been preempted
> -----------------------------------------------------------------------
>
>                 Key: MAPREDUCE-2205
>                 URL: https://issues.apache.org/jira/browse/MAPREDUCE-2205
>             Project: Hadoop Map/Reduce
>          Issue Type: Bug
>          Components: contrib/fair-share
>            Reporter: Joydeep Sen Sarma
>            Assignee: Scott Chen
>
> We have hit a problem with the preemption implementation in the FairScheduler 
> where the following happens:
> # job X runs short of fair share or min share and requests/causes N tasks to 
> be preempted
> # when slots are then scheduled - tasks from some other job are actually 
> scheduled
> # after preemption_interval has passed, job X finds it's still underscheduled 
> and requests preemption. goto 1.
> This has caused widespread preemption of tasks and the cluster going from 
> high utilization to low utilization in a few minutes.
> After doing some analysis of the logs - one of the biggest contributing 
> factors seems to be the scheduling of jobs when a heartbeat with multiple 
> slots is advertised. currently it goes over all the jobs/pools (in sorted) 
> order until all the slots are exhausted. this leads to lower priority jobs 
> also getting scheduled (that may have just been preempted).

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to