Ok, I think I have boiled the problem down. Turns out the jobtracker was
actually never running on my BoxA When I start the program I get this
error at the end of my jobtracker-log
Exception in thread "main" java.lang.NullPointerException
at
org.apache.nutch.mapred.JobTrackerInfoServer.<init>(JobTrackerInfoServer.java:67)
at
org.apache.nutch.mapred.JobTracker.<init>(JobTracker.java:241)
at
org.apache.nutch.mapred.JobTracker.startTracker(JobTracker.java:45)
at org.apache.nutch.mapred.JobTracker.main(JobTracker.java:1070)
Trying to figure out what this means now.
On Wed, 2005-12-14 at 23:56 +0100, Stefan Groschupf wrote:
> A common problem is that a boxA says to boxB my name is
> boxA.localnetwork but boxB can not connect the box with the name
> boxA.localnetwork since the name from the outside would be somthing
> like:
> boxA.companyDomain.com So double check that the name the boxA use to
> identify itself against other boxes (host.conf) is also setuped in
> the dns the other boxes use.
>
> HTH
> Stefan
>
>
> Am 14.12.2005 um 22:49 schrieb Michael Taggart:
>
> > I've followed the steps in the media-style wiki for setting up a map
> > reduce system. I am only having one strange error when I attempt to
> > start the tasktrackers. Here is my output:
> >
> > [EMAIL PROTECTED] nutch]# bin/nutch-daemon.sh start tasktracker
> > starting tasktracker, logging
> > to /usr/local/nutch/nutch-root-tasktracker-srv08.xxxxx.com.log
> > 051214 133808 parsing file:/usr/local/nutch/conf/nutch-default.xml
> > 051214 133808 parsing file:/usr/local/nutch/conf/nutch-site.xml
> > 051214 133808 Server listener on port 50050: starting
> > 051214 133808 Server handler 0 on 50050: starting
> > 051214 133808 Server handler 1 on 50050: starting
> > 051214 133808 Server listener on port 50040: starting
> > 051214 133808 Server handler 0 on 50040: starting
> > 051214 133808 Server handler 1 on 50040: starting
> > java.net.ConnectException: Connection refused
> > at java.net.PlainSocketImpl.socketConnect(Native Method)
> >
> > I have configured my nutch-site.xml as follows on each box:
> > ?xml version="1.0"?>
> > <?xml-stylesheet type="text/xsl" href="nutch-conf.xsl"?>
> >
> > <!-- Put site-specific property overrides in this file. -->
> >
> > <nutch-conf>
> > <property>
> > <name>fs.default.name</name>
> > <value>srv05.xxxxx.com:50000</value>
> > <description>The name of the default file system. Either the
> > literal string "local" or a host:port for NDFS.</description>
> > </property>
> > <property>
> > <name>mapred.job.tracker</name>
> > <value>srv05.xxxxx.com:50020</value>
> > <description>The host and port that the MapReduce job tracker runs
> > at. If "local", then jobs are run in-process as a single map
> > and reduce task.
> > </description>
> > </property>
> > </nutch-conf>
> >
> > srv05 is my namenode and jobtracker. That server starts up the
> > namenode
> > and jobtracker services just fine. Maybe I am supposed to reference
> > fs.default.name as srv08:50000 on srv08? I thought from reading the
> > mediawiki that I need to reference my BoxA on every other machine.
> > No firewall on this internal network so I am wondering why I am
> > getting
> > a connection refused. Anyone have any ideas?
> > Thanks,
> > Mike
> >
>
> ---------------------------------------------------------------
> company: http://www.media-style.com
> forum: http://www.text-mining.org
> blog: http://www.find23.net
>
>