[ https://issues.apache.org/jira/browse/HADOOP-1917?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12539186 ]
Nigel Daley commented on HADOOP-1917: ------------------------------------- Here is feedback on the first half of the mapred tutorial from HADOOP-1917_2_20071031.patch: "serve as a Tutorial" -> "serve as a tutorial" up-and-running -> running parallelly -> in parallel built of commodity -> built with commodity which processed -> which are processed in completely -> in a completely The frameworks sorts -> The framework sorts in a FileSystem -> in a filesystem re-executes the failed ones -> re-execution of the failed ones Normally, the -> Typically the Hence the framework -> This configuration enables the framework to of a master -> of a single master per node in the cluster -> per cluster node scheduling the jobs' -> scheduling the job's interfaces/classes -> interfaces or abstract classes This, and other facets -> These, and other parameter & monitoring -> and monitoring (appears in a number of places) to the job-client etc. -> to the job client. (either remove "etc." or expand it out to list more items sent to the job client) make Hadoop Streaming and Hadoop Pipes sentences bullet points. I haven't compiled the forrest. Do these type of urls work? api/index.html?org/apache/hadoop/streaming/package-summary.html and/or the reducer. -> and/or the reducer function. try to avoid <code>interface or class name</code>s (followed by an s). The <code>key</code>s and <code>value</code>s -> The key and value classes Additionally the <code>key</code>s -> Additionally, the key class have to be -> have to implement (then remove trailing 's' from WritableComparable) Input & Output -> Input and Output Lets walk through a simple Map-Reduce application before we jump into details to get a flavour for how they work. -> Before jumping into details, lets walk through a simple Map-Reduce example to get a flavour for how they work. WalkThrough -> Walk-through perhaps you should first talk about what inputs are passed to the map method. line nos. -> lines (IMO this simplifies the reading) line no. -> line line# -> line output of the each -> output of each (same as the -> (the combiner is the same as the you don't introduce the concept of a combiner -- that may need more explanation (or leave it out of this tutorial) (word) -> (or word in this example) of the program -> method with the given -> method with the given interfaces/classes -> interfaces and classes (appears in a number of places in different orders) > Need configuration guides for Hadoop > ------------------------------------ > > Key: HADOOP-1917 > URL: https://issues.apache.org/jira/browse/HADOOP-1917 > Project: Hadoop > Issue Type: Improvement > Components: conf > Affects Versions: 0.14.1 > Reporter: Sameer Paranjpye > Assignee: Arun C Murthy > Priority: Critical > Fix For: 0.16.0 > > Attachments: HADOOP-1917_1_20071025.patch, > HADOOP-1917_2_20071031.patch, HADOOP-1917_3_20071031.patch > > > We've recently had a spate of questions on the users list regarding features > such as rack-awareness, the trash can etc. which are not clearly documented > from a user/admins perspective. There is some Javadoc present but most of the > "documentation" exists either in JIRA or in the default config files > themselves. > We should generate top down configuration and use guides for map/reduce and > HDFS. These should probably be in forest and accessible from the project > website (Javadoc isn't always approachable to our non-programmer audience). > Committers should look for user documentation before accepting patches. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.