I have the same original doubt. I know that the log shows informations,
but, how to see the things happening, real time, like in nutch 0.7.2, when
you use the crawl command in the terminal?
----- Original Message -----
From: "Ben Ogle" <[EMAIL PROTECTED]>
To: <[email protected]>
Sent: Wednesday, September 13, 2006 5:59 PM
Subject: Re: 0.8 Intranet Crawl Output/Logging?
Look in the hadoop.log file under the nutch-0.8/logs dir. It should have
that
info.
Ben
jared.dunne wrote:
I am using the nutch 0.8 'crawl' command to crawl some content. When I
run the crawl command, I don't see any output, but the crawl is
running... Is there a way to see information about what the crawler is
doing?
I have tried setting 'fetcher.verbose' to 'true' in my nutch-site.xml
causing no change to the behaviour.
I am trying to enable some plugins (file protocol and parse-xml plugin)
but I cant tell if they are being loaded correctly with out some output
from nutch.
Thanks!
Jared-
--
View this message in context:
http://www.nabble.com/0.8-Intranet-Crawl-Output-Logging--tf2267654.html#a6294542
Sent from the Nutch - User forum at Nabble.com.
--
No virus found in this incoming message.
Checked by AVG Free Edition.
Version: 7.1.405 / Virus Database: 268.12.3/444 - Release Date: 11/9/2006