Kuro,
Thanks for the tip. I made the changes you suggested and took a look at the
debug output, which allowed me to realize that my difficulties were actually
occurring before I got the "Job Failed" exception. Specifically, when I call
the bin/nutch inject command at the beginning of the my "whole-web" crawl, I
get the following error, which I haven't been able to figure out (any
insights are much apprecaited):
2006-06-14 18:02:32,401 DEBUG
conf.Configuration(Configuration.java:<init>(67))-
java.io.IOException: config()
at org.apache.hadoop.conf.Configuration.<init>(Configuration.java:67)
at
org.apache.nutch.util.NutchConfiguration.create(NutchConfiguration.java:50)
at org.apache.nutch.crawl.Injector.main(Injector.java:148)
This problem seemed like a different issue so I posted a separate post about
it yesterday:
http://mail-archives.apache.org/mod_mbox/lucene-nutch-user/200606.mbox/[EMAIL
PROTECTED]
Thanks,
Bryan
On 6/16/06, Teruhiko Kurosaka <[EMAIL PROTECTED]> wrote:
Bryan,
Some recent changes in the logging code changed the default logging
behavior;
nutch doesn't output anything to the console. (It supposedly sends the
logging
output to a file described as ${nutch.log.dir}/${nutch.log.file} but I
don't know
what the default values of these variables.)
You can change conf/log4j.propertiies to change the logging behavior of
the "nutch" command line. (There is another logging properties for
search
GUI.) I changed conf/log4j.properties like outlined below, to enable
full debug
logging. (Only changed lines are shown).
#log4j.rootLogger=INFO,DRFA
log4j.rootLogger=DEBUG, stdout
#log4j.logger.org.apache.nutch=INFO
#log4j.logger.org.apache.hadoop=WARN
I hope this helps.
-kuro
> From: Bryan Woliner [mailto:[EMAIL PROTECTED]
> Sent: 2006-6-15 18:21
> $ bin/nutch crawl test -dir crawl3 -depth 2 -topN 50
>
> It seemed like everything worked correctly (although unlike nutch
0.7.1, no
> ouput was generated)
_______________________________________________
Nutch-general mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/nutch-general