Hello,

How do I check that all pages have been fetched? Is there a command or tool,
that says like:
these are the number of pages in the website, the number of pages fetched,
pages filtered...
give a report. If errors, how many and give a brief description...

I understand analyzing log and readdb with stats/dumppageurl is one option.
But, it is time consuming and requires unwanted manual work. If there is a
tool/command that did the above option, I could just easily parse the report
for my web services.
-- 
View this message in context: 
http://www.nabble.com/Nutch-Data-Testing-tf2742246.html#a7651128
Sent from the Nutch - User mailing list archive at Nabble.com.

Reply via email to