Hi,

I am using Nutch 1.0.

For simple excercise i have crawled one single domain and after that i
tried both command readdb and readseg...
Both showing different figures. Which one i should consider? does
something went wrong while crawling?

Here is the output of both command.

OUTPUT FROM READDB:
----------------------------------------
CrawlDb statistics start: crawled/crawldb
Statistics for CrawlDb: crawled/crawldb
TOTAL urls:     84178
retry 0:        84175
retry 1:        3
min score:      0.0
avg score:      7.1693314E-5
max score:      1.2
status 1 (db_unfetched):        80475
status 2 (db_fetched):  3634
status 3 (db_gone):     8
status 4 (db_redir_temp):       29
status 5 (db_redir_perm):       32
CrawlDb statistics: done


OUTPUT FROM READSEG:
-------------------------------------------
NAME            GENERATED       FETCHER START           FETCHER END
         FETCHED PARSED
20091212212627  1               2009-12-12T21:28:29
2009-12-12T21:28:29     1       1
20091212212951  81              2009-12-12T21:32:20
2009-12-12T21:32:54     105     80
20091212213347  3691            2009-12-12T21:36:13
2009-12-12T22:16:39     3738    3621
20091212222210  84178           2009-12-12T22:24:30
2009-12-13T11:08:28     85189   81806
20091213151344  84178           2009-12-13T15:16:37
2009-12-14T05:50:45     85195   81824


Thanks.
Bhavin

Reply via email to