[ 
https://issues.apache.org/jira/browse/MAPREDUCE-944?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12752105#action_12752105
 ] 

Vinod K V commented on MAPREDUCE-944:
-------------------------------------

I see in the patch attached that only one concrete implementation 
CapBasedLoadManager is done for the LoadManager which in turn doesn't take into 
account any resource usage. I guess you are planning a proper implementation 
for this feature regarding fair-share of memory usage in another JIRA.

Some points still not dealt with in this JIRA. I bring about these points so as 
to know if you are thinking or have already thought anything about this.
 - Job configuration about how users specify the resource usage. Some memory 
related configuration properties are added to the framework while working for 
memory monitoring on TTs as well as memory usage based scheduling in 
CapacityTaskScheduler. You may want to reuse some/all of it.
 - Capturing the scheduling decisions involved when we are not able to find a 
task from a Schedulable because of lack of resources on a given TaskTasker.

Regarding the latter, the current patch just returns null, which is similar to 
the decision CapacityTaskScheduler used to take in previous versions - i.e. 
block the TT till it can be given a task from the job at the head of the 
queue/pool. Sometime back, we investigated how this approach works with 
FairScheduler and realized some important implications. For e.g, because the 
order of jobs might change significantly in consecutive iterations of 
FairScheduler, just returning null may not work at all. Eventually we may end 
up waiting for a long time if significant number of jobs ask for high amount of 
resources.

Thoughts?

> Extend FairShare scheduler to fair-share memory usage in the cluster
> --------------------------------------------------------------------
>
>                 Key: MAPREDUCE-944
>                 URL: https://issues.apache.org/jira/browse/MAPREDUCE-944
>             Project: Hadoop Map/Reduce
>          Issue Type: Improvement
>          Components: contrib/fair-share
>            Reporter: dhruba borthakur
>         Attachments: LoadManager.txt
>
>
> The FairShare Scheduler has an extensible LoadManager API to regulate 
> allocating new tasks on a particular TaskTracker. In similar lines, it would 
> be nice if the FairShare Scheduler can have a pluggable policy to regulate 
> new tasks from a particular job. This will allow one to skip scheduling tasks 
> of a job that  is eating a large percentage of memory in the cluster, i.e. 
> fair-share of memory resources among jobs. 

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to