[
https://issues.apache.org/jira/browse/HADOOP-1638?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12514425
]
Tom White commented on HADOOP-1638:
-----------------------------------
This problem was caused by the changes made in Amazon EC2 addressing:
previously instances were direct addressed (given a single IP routable address)
and now they are NAT-addressed (by default, for later tool versions). The key
point is that NAT-addressed instances can't access other NAT-addressed
instances using the public address. Direct addressing is going to be phased
out. See
http://developer.amazonwebservices.com/connect/entry.jspa?externalID=682&categoryID=100
for more details.
Tools versions ec2-api-tools-1.2-9739 and later use NAT addressing, and I have
been using ec2-api-tools-1.2-7546 (although I thought I had been using
ec2-api-tools-1.2-9739) which still uses direct addressing.
I don't think HADOOP-1202 will make this a non-issue since EC2 NAT instances
cannot route to the public address of other instances. So even if the namenode
and job tracker could bind to the public address that would not be much help to
the slaves since they have to connect to the internal address - so this patch
would still be needed.
Stu, I agree that it would be nice to fix this problem more thoroughly but
until we have a better solution I think this approach is fine.
I've tested with the last three versions of ec2-api-tools and have successfully
run the grep example on small multi-node clusters. When NAT-addressing is used
however the webservers on datanodes and task trackers are not accessible since
non-routable addresses are used. Apart from this limitation (which can be
worked around by logging in to the relevant machine to browse logs) jobs ran OK.
So I vote to commit this (along with HADOOP-1635, HADOOP-1634) - I'll have some
time to do this tomorrow.
> Master node unable to bind to DNS hostname
> ------------------------------------------
>
> Key: HADOOP-1638
> URL: https://issues.apache.org/jira/browse/HADOOP-1638
> Project: Hadoop
> Issue Type: Bug
> Components: contrib/ec2
> Affects Versions: 0.13.0, 0.13.1, 0.14.0, 0.15.0
> Reporter: Stu Hood
> Priority: Minor
> Fix For: 0.13.1, 0.14.0, 0.15.0
>
> Attachments: hadoop-1638.patch
>
>
> With a release package of Hadoop 0.13.0 or with latest SVN, the Hadoop
> contrib/ec2 scripts fail to start Hadoop correctly. After working around
> issues HADOOP-1634 and HADOOP-1635, and setting up a DynDNS address pointing
> to the master's IP, the ec2/bin/start-hadoop script completes.
> But the cluster is unusable because the namenode and tasktracker have not
> started successfully. Looking at the namenode log on the master reveals the
> following error:
> {quote}
> 2007-07-19 16:54:53,156 ERROR org.apache.hadoop.dfs.NameNode:
> java.net.BindException: Cannot assign requested address
> at sun.nio.ch.Net.bind(Native Method)
> at
> sun.nio.ch.ServerSocketChannelImpl.bind(ServerSocketChannelImpl.java:119)
> at sun.nio.ch.ServerSocketAdaptor.bind(ServerSocketAdaptor.java:59)
> at org.apache.hadoop.ipc.Server$Listener.<init>(Server.java:186)
> at org.apache.hadoop.ipc.Server.<init>(Server.java:631)
> at org.apache.hadoop.ipc.RPC$Server.<init>(RPC.java:325)
> at org.apache.hadoop.ipc.RPC.getServer(RPC.java:295)
> at org.apache.hadoop.dfs.NameNode.init(NameNode.java:164)
> at org.apache.hadoop.dfs.NameNode.<init>(NameNode.java:211)
> at org.apache.hadoop.dfs.NameNode.createNameNode(NameNode.java:803)
> at org.apache.hadoop.dfs.NameNode.main(NameNode.java:811)
> {quote}
> The master node refuses to bind to the DynDNS hostname in the generated
> hadoop-site.xml. Here is the relevant part of the generated file:
> {quote}
> <property>
> <name>fs.default.name</name>
> <value>blah-ec2.gotdns.org:50001</value>
> </property>
> <property>
> <name>mapred.job.tracker</name>
> <value>blah-ec2.gotdns.org:50002</value>
> </property>
> {quote}
> I'll attach a patch against hadoop-trunk that fixes the issue for me, but I'm
> not sure if this issue is something that someone can fix more thoroughly.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.