[
https://issues.apache.org/jira/browse/HADOOP-6616?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13471805#comment-13471805
]
Joep Rottinghuis commented on HADOOP-6616:
------------------------------------------
Update looks good. I ran the perl and python example scripts. The first two ran
fine. See comment below on the BASH script.
Nit: (line 18 of the patch)
The <code>NameNode</code> and the <code>JobTracker</code> obtains
NN and JT obtain (singular).
I think this same error already exists in
http://hadoop.apache.org/docs/r1.0.3/cluster_setup.html#Hadoop+Rack+Awareness
One refinement to
{noformat}
The jobtracker uses rack awareness to reduce network transfers of HDFS data
blocks, as it will schedule tasks on nodes located within the same rack
containing the needed HDFS data blocks.
{noformat}
If the tasks cannot be scheduled on the DNs containing the needed HDFS blocks,
then the tasks will be scheduled on the same rack to reduce network transfers
if possible.
Line 41 again the NN and JT obtain instead of obtains (plural).
Line 65:
{noformat}
Hadoop will send multiple IP addresses on STDIN when forking the topology
script.
{noformat}
I think IP addresses are passed as arguments, not on STDIN. The first Perl
script reads this correctly from ARGV but the comment in the script reads that
it gets it from STDIN.
The BASH example for flat network always returns /rack-unkown
I think that is due to
{noformat}
if [ -n $# ];
{noformat}
Something like
{noformat}
if [ "$1" == "" ]; then
{noformat}
Once that is fixed, the script errors out missing closing brace in the for
statement. This should be:
{noformat}
for host in ${BASH_ARGV[*]}; do
{noformat}
I had some trouble with mis-matching single quote (which is strange as they
occurred only in the comments).
Same STDIN comment in other scripts.
I could not get the last python script (the one that makes assumptions about
the physical environment) to work because I do not have hosts that are called
"dn" something.
> Improve documentation for rack awareness
> ----------------------------------------
>
> Key: HADOOP-6616
> URL: https://issues.apache.org/jira/browse/HADOOP-6616
> Project: Hadoop Common
> Issue Type: Improvement
> Components: documentation
> Reporter: Jeff Hammerbacher
> Labels: newbie
> Attachments: hadoop-6616.patch, hadoop-6616.patch.2
>
>
> The current documentation for rack awareness
> (http://hadoop.apache.org/common/docs/r0.20.0/cluster_setup.html#Hadoop+Rack+Awareness)
> should be augmented to include a sample script.
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira