Two options:
bin/nutch readdb crawl/db -stats
or use Luke (Google for luke lucene) to open the Lucene index.
Erik
On Jul 28, 2005, at 9:44 PM, blackwater dev wrote:
After I finish a crawl...what is the best way to go into my crawl
directory and get the number of indexed pages?
Hello,
First one will give you number of pages in WebDB and not all of them
are indexed.
Regards,
Piotr
On 7/29/05, Erik Hatcher [EMAIL PROTECTED] wrote:
Two options:
bin/nutch readdb crawl/db -stats
or use Luke (Google for luke lucene) to open the Lucene index.
Erik
On