Hello, How do I check that all pages have been fetched? Is there a command or tool, that says like: these are the number of pages in the website, the number of pages fetched, pages filtered... give a report. If errors, how many and give a brief description...
I understand analyzing log and readdb with stats/dumppageurl is one option. But, it is time consuming and requires unwanted manual work. If there is a tool/command that did the above option, I could just easily parse the report for my web services. -- View this message in context: http://www.nabble.com/Nutch-Data-Testing-tf2742246.html#a7651128 Sent from the Nutch - User mailing list archive at Nabble.com.
