Kuro,

Thanks for the tip. I made the changes you suggested and took a look at the
debug output, which allowed me to realize that my difficulties were actually
occurring before I got the "Job Failed" exception. Specifically, when I call
the bin/nutch inject command at the beginning of the my "whole-web" crawl, I
get the following error, which I haven't been able to figure out (any
insights are much apprecaited):

2006-06-14 18:02:32,401 DEBUG
conf.Configuration(Configuration.java:<init>(67))-
java.io.IOException: config()
       at org.apache.hadoop.conf.Configuration.<init>(Configuration.java:67)
        at 
org.apache.nutch.util.NutchConfiguration.create(NutchConfiguration.java:50)
        at org.apache.nutch.crawl.Injector.main(Injector.java:148)

This problem seemed like a different issue so I posted a separate post about
it  yesterday:
http://mail-archives.apache.org/mod_mbox/lucene-nutch-user/200606.mbox/[EMAIL 
PROTECTED]

Thanks,
Bryan


On 6/16/06, Teruhiko Kurosaka <[EMAIL PROTECTED]> wrote:

Bryan,
Some recent changes in the logging code changed the default logging
behavior;
nutch doesn't output anything to the console. (It supposedly sends the
logging
output to a file described as ${nutch.log.dir}/${nutch.log.file} but I
don't know
what the default values of these variables.)

You can change conf/log4j.propertiies to change the logging behavior of
the "nutch" command line.  (There is another logging properties for
search
GUI.)  I changed conf/log4j.properties like outlined below, to enable
full debug
logging. (Only changed lines are shown).
#log4j.rootLogger=INFO,DRFA
log4j.rootLogger=DEBUG, stdout
#log4j.logger.org.apache.nutch=INFO
#log4j.logger.org.apache.hadoop=WARN

I hope this helps.
-kuro

> From: Bryan Woliner [mailto:[EMAIL PROTECTED]
> Sent: 2006-6-15 18:21

> $ bin/nutch crawl test -dir crawl3 -depth 2 -topN 50
>
> It seemed like everything worked correctly (although unlike nutch
0.7.1, no
> ouput was generated)


_______________________________________________
Nutch-general mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/nutch-general

Reply via email to