[ 
https://issues.apache.org/jira/browse/MAPREDUCE-1018?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Hemanth Yamijala updated MAPREDUCE-1018:
----------------------------------------

    Attachment: MAPRED-1018-7.patch.txt

The attached patch [MAPRED-1018-7.patch.txt], makes changes to the memory 
monitoring, configuration and scheduling sections. The changes over earlier 
patches are primarily in the nature of presentation.

A brief summary:

- Mapreduce Tutorial: This now describes all the Job specific memory 
configuration options. The expectation is that users who have questions about 
how to configure memory requirements for their jobs can get answers here.

- Cluster Setup: This describes memory monitoring, links to the job specific 
options in the Mapreduce tutorial, and describes in detail how to configure 
cluster specific memory configuration options. I've removed the duplication of 
describing the job specific options in both the places, since I felt it will be 
a problem to maintain changes. This describes memory related aspects from an 
administrator point of view.

- Capacity Scheduler: This describes memory based scheduling. Instead of 
spelling the precise algorithm out, I have given a gist of how the scheduler 
works. The description is more in terms of *what* the scheduler does, rather 
than how it does it.

Request a review of the same. Please ensure that all the required content is 
captured, and I've not missed out anything when reorganizing. Also, please 
check if the documentation is clear to understand.

One thing I've not included is the documentation on the RSS based monitoring 
introduced in MAPREDUCE-1221. I am not yet familiar with that part of the code. 
Also, this patch itself looks reasonably big. Hence, I would request for those 
changes to be incorporated as a follow-up, though they should be treated as a 
blocker for the 0.21 release as well. Thoughts ?

> Document changes to the memory management and scheduling model
> --------------------------------------------------------------
>
>                 Key: MAPREDUCE-1018
>                 URL: https://issues.apache.org/jira/browse/MAPREDUCE-1018
>             Project: Hadoop Map/Reduce
>          Issue Type: Bug
>          Components: documentation
>    Affects Versions: 0.21.0
>            Reporter: Hemanth Yamijala
>            Assignee: Hemanth Yamijala
>            Priority: Blocker
>             Fix For: 0.21.0
>
>         Attachments: MAPRED-1018-1.patch, MAPRED-1018-2.patch, 
> MAPRED-1018-3.patch, MAPRED-1018-4.patch.txt, MAPRED-1018-5.patch.txt, 
> MAPRED-1018-6.patch.txt, MAPRED-1018-7.patch.txt, MAPRED-1018-commons.patch
>
>
> There were changes done for the configuration, monitoring and scheduling of 
> high ram jobs. This must be documented in the mapred-defaults.xml and also on 
> forrest documentation

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to