[ 
https://issues.apache.org/jira/browse/HADOOP-4439?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12640465#action_12640465
 ] 

Hemanth Yamijala commented on HADOOP-4439:
------------------------------------------

The following changes are identified:

- We remove the concept of having a default memory per task on the TT, 
introduced in HADOOP-3759 as the max memory per TT / number of slots. The 
problem with this model is that in a heterogenous cluster, different TTs could 
give different default memory per task values for the same job, which is 
confusing.
- Instead, we introduce a default memory per task configuration variable that 
is expected to be controlled by the cluster admin. This is the value that will 
be used for a job which does not specify any memory requirements. The advantage 
with this model is that it eases configuration, and makes the default value 
consistent for the users.
- If a job has not specified any memory requirements, this variable would be 
set to the job's configuration, maybe via the {{Task}} object.
- We modify the algorithm of protecting RAM limits introduced in HADOOP-3581 to 
use the configured memory per task instead of the default memory per task.
- We remove the reporting of the default memory per task, introduced in 
HADOOP-3759, done via the {{TaskTrackerStatus.ResourceStatus}}. Instead we 
report the total memory available on the TT instead of the default memory per 
task.
- When HADOOP-4053 is fixed, the above values would be used to schedule tasks.
- However until HADOOP-4053 is fixed, these configuration parameters and the 
corresponding {{JobConf}} variables should not be exposed in any public API or 
documentation, as they could confuse users. This can be turned on after Hadoop 
0.19

> Cleanup memory related resource management
> ------------------------------------------
>
>                 Key: HADOOP-4439
>                 URL: https://issues.apache.org/jira/browse/HADOOP-4439
>             Project: Hadoop Core
>          Issue Type: Bug
>          Components: mapred
>    Affects Versions: 0.19.0
>            Reporter: Hemanth Yamijala
>            Assignee: Hemanth Yamijala
>            Priority: Blocker
>             Fix For: 0.19.0
>
>
> HADOOP-3759 and HADOOP-3581 introduced memory based resource management. This 
> JIRA is to cleanup certain aspects of the two issues that came up while doing 
> HADOOP-4035, which is filed to support memory based scheduling 

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to