I'm trying to cluster nutch with hadoop on two machines. When attempting to
run crawl command, I get a Connection refused error on the screen and the
following NoRouteToHostException in the slave log files. The processes on the
slave machine attempts to start, but crashes within seconds. What could cause
this? The ssh is working fine. Is hadoop trying to use telnet command? The
telnet command produces a No route to host message. I've successfully run a
crawl on standalone machine. It's only the clustering that is not working.
Any help would be appreciated.
Logs
=====
~daclark/nutch/search/nutch-0.9/logs/hadoop-daclark-tasktracker-<MY.MACHINE.NAME>.log
~daclark/nutch/search/nutch-0.9/logs/hadoop-daclark-datanode-<MY.MACHINE.NAME>.log
Errors
======
2007-07-02 11:14:26,937 ERROR mapred.TaskTracker - Can not start task tracker
because java.net.NoRouteToHostException: No route to host
at java.net.PlainSocketImpl.socketConnect(Native Method)
at java.net.PlainSocketImpl.doConnect(PlainSocketImpl.java:333)
at java.net.PlainSocketImpl.connectToAddress(PlainSocketImpl.java:195)
at java.net.PlainSocketImpl.connect(PlainSocketImpl.java:182)
at java.net.SocksSocketImpl.connect(SocksSocketImpl.java:366)
at java.net.Socket.connect(Socket.java:519)
at
org.apache.hadoop.ipc.Client$Connection.setupIOstreams(Client.java:149)
at org.apache.hadoop.ipc.Client.getConnection(Client.java:531)
at org.apache.hadoop.ipc.Client.call(Client.java:458)
at org.apache.hadoop.ipc.RPC$Invoker.invoke(RPC.java:163)
at org.apache.hadoop.mapred.$Proxy0.getProtocolVersion(Unknown Source)
at org.apache.hadoop.ipc.RPC.getProxy(RPC.java:247)
at org.apache.hadoop.ipc.RPC.waitForProxy(RPC.java:226)
at org.apache.hadoop.mapred.TaskTracker.initialize(TaskTracker.java:317)
at org.apache.hadoop.mapred.TaskTracker.<init>(TaskTracker.java:476)
at org.apache.hadoop.mapred.TaskTracker.main(TaskTracker.java:1589)
2007-07-02 11:14:23,930 ERROR dfs.DataNode - java.net.NoRouteToHostException:
No route to host
at java.net.PlainSocketImpl.socketConnect(Native Method)
at java.net.PlainSocketImpl.doConnect(PlainSocketImpl.java:333)
at java.net.PlainSocketImpl.connectToAddress(PlainSocketImpl.java:195)
at java.net.PlainSocketImpl.connect(PlainSocketImpl.java:182)
at java.net.SocksSocketImpl.connect(SocksSocketImpl.java:366)
at java.net.Socket.connect(Socket.java:519)
at
org.apache.hadoop.ipc.Client$Connection.setupIOstreams(Client.java:149)
at org.apache.hadoop.ipc.Client.getConnection(Client.java:531)
at org.apache.hadoop.ipc.Client.call(Client.java:458)
at org.apache.hadoop.ipc.RPC$Invoker.invoke(RPC.java:163)
at org.apache.hadoop.dfs.$Proxy0.getProtocolVersion(Unknown Source)
at org.apache.hadoop.ipc.RPC.getProxy(RPC.java:247)
at org.apache.hadoop.ipc.RPC.waitForProxy(RPC.java:226)
at org.apache.hadoop.dfs.DataNode.<init>(DataNode.java:252)
at org.apache.hadoop.dfs.DataNode.<init>(DataNode.java:198)
at org.apache.hadoop.dfs.DataNode.makeInstance(DataNode.java:1153)
at org.apache.hadoop.dfs.DataNode.run(DataNode.java:1081)
at org.apache.hadoop.dfs.DataNode.runAndWait(DataNode.java:1111)
at org.apache.hadoop.dfs.DataNode.main(DataNode.java:1275)
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
Daniel Clark, President
DAC Systems, Inc.
5209 Nanticoke Court
Centreville, VA 20120
Cell - (703) 403-0340
Email - [EMAIL PROTECTED]
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~