Well, that's kinda what's happening. I checked and the tasktrackers are running like you said. However, when I follow the tutorial at http://wiki.media-style.com/display/nutchDocu/setup+a+map+reduce+multi +box+system When I get to command to generate segments and check the generated segment, the command bin/nutch ndfs -ls segments always returns 0 results. It's supposed to give me a file like 20051214001226. In addition, that connection refused thing occurs when I try to generate the segments to crawldb.
On Wed, 2005-12-14 at 14:02 -0800, Matt Zytaruk wrote: > I get the same error sometimes, although everything seems to work fine > after that, even though it gives that error, so it's probably not a problem. > > - Matt Zytaruk > > > Michael Taggart wrote: > > >I've followed the steps in the media-style wiki for setting up a map > >reduce system. I am only having one strange error when I attempt to > >start the tasktrackers. Here is my output: > > > >[EMAIL PROTECTED] nutch]# bin/nutch-daemon.sh start tasktracker > >starting tasktracker, logging > >to /usr/local/nutch/nutch-root-tasktracker-srv08.xxxxx.com.log > >051214 133808 parsing file:/usr/local/nutch/conf/nutch-default.xml > >051214 133808 parsing file:/usr/local/nutch/conf/nutch-site.xml > >051214 133808 Server listener on port 50050: starting > >051214 133808 Server handler 0 on 50050: starting > >051214 133808 Server handler 1 on 50050: starting > >051214 133808 Server listener on port 50040: starting > >051214 133808 Server handler 0 on 50040: starting > >051214 133808 Server handler 1 on 50040: starting > >java.net.ConnectException: Connection refused > > at java.net.PlainSocketImpl.socketConnect(Native Method) > > > >I have configured my nutch-site.xml as follows on each box: > >?xml version="1.0"?> > ><?xml-stylesheet type="text/xsl" href="nutch-conf.xsl"?> > > > ><!-- Put site-specific property overrides in this file. --> > > > ><nutch-conf> > ><property> > > <name>fs.default.name</name> > > <value>srv05.xxxxx.com:50000</value> > > <description>The name of the default file system. Either the > > literal string "local" or a host:port for NDFS.</description> > ></property> > ><property> > > <name>mapred.job.tracker</name> > > <value>srv05.xxxxx.com:50020</value> > > <description>The host and port that the MapReduce job tracker runs > > at. If "local", then jobs are run in-process as a single map > > and reduce task. > > </description> > ></property> > ></nutch-conf> > > > >srv05 is my namenode and jobtracker. That server starts up the namenode > >and jobtracker services just fine. Maybe I am supposed to reference > >fs.default.name as srv08:50000 on srv08? I thought from reading the > >mediawiki that I need to reference my BoxA on every other machine. > >No firewall on this internal network so I am wondering why I am getting > >a connection refused. Anyone have any ideas? > >Thanks, > >Mike > > > > > > > > >
