[ 
http://issues.apache.org/jira/browse/NUTCH-318?page=comments#action_12423557 ] 
            
Sami Siren commented on NUTCH-318:
----------------------------------

could this be solved by just adding folowing line into conf/log4j.properties?

log4j.logger.org.apache.nutch.crawl.CrawlDbReader=INFO,stdout

for me it produces following output to stdout:
bin/nutch readdb ../nutch-0.8-release/crawl/crawldb -stats
2006-07-26 10:09:28,839 INFO  crawl.CrawlDbReader 
(CrawlDbReader.java:processStatJob(210)) - CrawlDb statistics start: 
../nutch-0.8-release/crawl/crawldb
2006-07-26 10:09:31,203 INFO  crawl.CrawlDbReader 
(CrawlDbReader.java:processStatJob(261)) - Statistics for CrawlDb: 
../nutch-0.8-release/crawl/crawldb
2006-07-26 10:09:31,204 INFO  crawl.CrawlDbReader 
(CrawlDbReader.java:processStatJob(277)) - TOTAL urls:      60
2006-07-26 10:09:31,206 INFO  crawl.CrawlDbReader 
(CrawlDbReader.java:processStatJob(272)) - avg score: 1.015
2006-07-26 10:09:31,206 INFO  crawl.CrawlDbReader 
(CrawlDbReader.java:processStatJob(270)) - max score: 1.103
2006-07-26 10:09:31,208 INFO  crawl.CrawlDbReader 
(CrawlDbReader.java:processStatJob(268)) - min score: 1.012
2006-07-26 10:09:31,209 INFO  crawl.CrawlDbReader 
(CrawlDbReader.java:processStatJob(277)) - retry 0:   60
2006-07-26 10:09:31,209 INFO  crawl.CrawlDbReader 
(CrawlDbReader.java:processStatJob(276)) - status 1 (DB_unfetched):  59
2006-07-26 10:09:31,211 INFO  crawl.CrawlDbReader 
(CrawlDbReader.java:processStatJob(276)) - status 2 (DB_fetched):    1
2006-07-26 10:09:31,212 INFO  crawl.CrawlDbReader 
(CrawlDbReader.java:processStatJob(282)) - CrawlDb statistics: done

of course it would look more nice if we create another format for such cases 
also (perhaps remove some unneccessary info)


> log4j not proper configured, readdb doesnt give any information
> ---------------------------------------------------------------
>
>                 Key: NUTCH-318
>                 URL: http://issues.apache.org/jira/browse/NUTCH-318
>             Project: Nutch
>          Issue Type: Bug
>    Affects Versions: 0.8-dev
>            Reporter: Stefan Groschupf
>            Priority: Critical
>             Fix For: 0.9-dev
>
>
> In the latest .8 sources the readdb command doesn't dump any information 
> anymore. 
> This is realeated to the miss configured log4j.properties file. 
> changing:
> log4j.rootLogger=INFO,DRFA
> to:
> log4j.rootLogger=INFO,DRFA,stdout
> dumps the information to the console, but not in a nice way. 
> What makes me wonder  is that these information should be also in the log 
> file, but the arn't, so there are may be even here problems.
> Also what is the different between hadoop-XXX-jobtracker-XXX.out and 
> hadoop-XXX-jobtracker-XXX.log ?? Shouldn't there just one of them?

-- 
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the administrators: 
http://issues.apache.org/jira/secure/Administrators.jspa
-
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

-------------------------------------------------------------------------
Take Surveys. Earn Cash. Influence the Future of IT
Join SourceForge.net's Techsay panel and you'll get the chance to share your
opinions on IT & business topics through brief surveys -- and earn cash
http://www.techsay.com/default.php?page=join.php&p=sourceforge&CID=DEVDEV
_______________________________________________
Nutch-developers mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/nutch-developers

Reply via email to