Hi Ken, I am not such an expert on linux, however I have solved this problem with ssh key based authentication.
http://www.cyberciti.biz/nixcraft/vivek/blogger/2004/05/ssh-public-key-based-authentication.php after this operation you should be able to connect to all your nodes from your namenode/jobtracker Hope it helps. G. On Sat, 2006-01-14 at 09:33 -0800, Ken Krugler wrote: > Hi all, > > We've got the nutch-2006-01-12.tar version of Nutch, and are trying > to run it on three machines. > > 192.168.0.100 is the "master" machine, where we run the JobTracker > and NameNode processes. > > 192.168.0.101 and 192.168.0.103 are the "slave" machines, where we > rung the TaskTracker and DataNode processes. > > When we fire off the the Nuch daemons with ./bin/start-all.sh, we get > the following error right away: > > 192.168.0.103: rsync from 192.168.0.100:/home/crawler/nutch > 192.168.0.103: Host key verification failed. > 192.168.0.103: rsync: connection unexpectedly closed (0 bytes > received so far) [receiver] > 192.168.0.103: rsync error: error in rsync protocol data stream (code > 12) at io.c(420) > 192.168.0.103: starting datanode, logging to > /home/crawler/tmp/logs/nutch-crawler-datanode-crawlerw3.log > > When I dump the datanode logfile from this 192.168.0.103 machine, I get: > > 060114 082814 10 parsing file:/home/crawler/nutch/conf/nutch-default.xml > 060114 082814 10 parsing file:/home/crawler/nutch/conf/nutch-site.xml > 060114 082814 10 Opened server at 50010 > 060114 082814 11 Starting DataNode in: /home/crawler/tmp/ndfs/data > 060114 082814 11 using BLOCKREPORT_INTERVAL of 3314538msec > 060114 082814 11 Exception: java.net.ConnectException: Connection refused > 060114 082814 11 Lost connection to namenode. Retrying... > 060114 082819 11 using BLOCKREPORT_INTERVAL of 3314538msec > 060114 082819 11 Exception: java.net.ConnectException: Connection refused > 060114 082819 11 Lost connection to namenode. Retrying... > 060114 082824 11 using BLOCKREPORT_INTERVAL of 3314538msec > 060114 082824 11 Exception: java.net.ConnectException: Connection refused > 060114 082824 11 Lost connection to namenode. Retrying... > 060114 082829 11 using BLOCKREPORT_INTERVAL of 3314538msec > 060114 082829 12 Client connection to 192.168.0.100:8009: starting > > When I dump the tasktracker logfile from 192.168.0.103, I get: > > 060114 082832 parsing file:/home/crawler/nutch/conf/nutch-default.xml > 060114 082832 parsing file:/home/crawler/nutch/conf/nutch-site.xml > 060114 082832 Server listener on port 50050: starting > 060114 082832 Server handler 0 on 50050: starting > 060114 082832 Server handler 1 on 50050: starting > 060114 082832 Server listener on port 50040: starting > 060114 082832 Server handler 0 on 50040: starting > 060114 082832 Server handler 1 on 50040: starting > 060114 082832 Lost connection to JobTracker > [main1/192.168.0.100:8010]. ex=java.net.ConnectException: Connection > refused Retrying... > > It seems like the 192.168.0.103 machine doesn't have the right > settings for connecting to the 192.168.0.100 machine. Is there a way > to check this outside of running Nutch? > > Thanks, > > -- Ken ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nutch-general mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/nutch-general
