Re: [Nutch-general] number of indexed pages

Piotr Kosiorowski Fri, 29 Jul 2005 02:39:02 -0700

Hello,
First one will give you number of pages in WebDB and not all of them
are indexed.


Regards,
Piotr

On 7/29/05, Erik Hatcher <[EMAIL PROTECTED]> wrote:
> Two options:
> 
>      bin/nutch readdb crawl/db -stats
> 
> or use Luke (Google for luke lucene) to open the Lucene index.
> 
>      Erik
> 
> On Jul 28, 2005, at 9:44 PM, blackwater dev wrote:
> 
> > After I finish a crawl...what is the best way to go into my crawl
> > directory and get the number of indexed pages?
> >
> > Thanks!
> >
> >
> > -------------------------------------------------------
> > SF.Net email is Sponsored by the Better Software Conference & EXPO
> > September
> > 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices
> > Agile & Plan-Driven Development * Managing Projects & Teams *
> > Testing & QA
> > Security * Process Improvement & Measurement * http://www.sqe.com/
> > bsce5sf
> > _______________________________________________
> > Nutch-general mailing list
> > [email protected]
> > https://lists.sourceforge.net/lists/listinfo/nutch-general
> >
> 
>

Re: [Nutch-general] number of indexed pages

Reply via email to