I'm trying to install hadoop on our linux machine but after
start-all.sh none of the slaves can connect:

2008-07-22 16:35:27,534 INFO org.apache.hadoop.dfs.DataNode: STARTUP_MSG:
/************************************************************
STARTUP_MSG: Starting DataNode
STARTUP_MSG:   host = thetis/127.0.0.1
STARTUP_MSG:   args = []
STARTUP_MSG:   version = 0.16.4
STARTUP_MSG:   build = http://svn.apache.org/repos/asf/hadoop/core/branches/bran
ch-0.16 -r 652614; compiled by 'hadoopqa' on Fri May  2 00:18:12 UTC 2008
************************************************************/
2008-07-22 16:35:27,643 WARN org.apache.hadoop.dfs.DataNode: Invalid directory i
n dfs.data.dir: directory is not writable: /work
2008-07-22 16:35:27,699 INFO org.apache.hadoop.ipc.Client: Retrying connect to s
erver: hermes.cse.sc.edu/129.252.130.148:9000. Already tried 1 time(s).
2008-07-22 16:35:28,700 INFO org.apache.hadoop.ipc.Client: Retrying connect to s
erver: hermes.cse.sc.edu/129.252.130.148:9000. Already tried 2 time(s).
2008-07-22 16:35:29,700 INFO org.apache.hadoop.ipc.Client: Retrying connect to s
erver: hermes.cse.sc.edu/129.252.130.148:9000. Already tried 3 time(s).
2008-07-22 16:35:30,701 INFO org.apache.hadoop.ipc.Client: Retrying connect to s
erver: hermes.cse.sc.edu/129.252.130.148:9000. Already tried 4 time(s).
2008-07-22 16:35:31,702 INFO org.apache.hadoop.ipc.Client: Retrying connect to s
erver: hermes.cse.sc.edu/129.252.130.148:9000. Already tried 5 time(s).
2008-07-22 16:35:32,702 INFO org.apache.hadoop.ipc.Client: Retrying connect to s
erver: hermes.cse.sc.edu/129.252.130.148:9000. Already tried 6 time(s).

same for the tasktrackers (port 9001).

I think the problem has something to do with name resolution. Check these out:

[EMAIL PROTECTED]:~/hadoop-0.16.4> telnet hermes.cse.sc.edu 9000
Trying 127.0.0.1...
Connected to hermes.cse.sc.edu (127.0.0.1).
Escape character is '^]'.
bye
Connection closed by foreign host.

[EMAIL PROTECTED]:~/hadoop-0.16.4> host hermes.cse.sc.edu
hermes.cse.sc.edu has address 129.252.130.148

[EMAIL PROTECTED]:~/hadoop-0.16.4> telnet 129.252.130.148 9000
Trying 129.252.130.148...
telnet: connect to address 129.252.130.148: Connection refused
telnet: Unable to connect to remote host: Connection refused

So, the first one connects but not the second one, but they both go to
the same machine:port. My guess is that the hadoop server is closing
the connection, but why?

Thanks,
Jose

-- 
Jose M. Vidal <[EMAIL PROTECTED]> http://jmvidal.cse.sc.edu
University of South Carolina http://www.multiagent.com

Reply via email to