Does -topN only grab the urls of the top x amount of
pages that you specify as per building your next
segment?

If so, would it not make sense to create a -batch or
something where you can specify to grab x amount of
urls that have yet to be fetched?

It seems usig -topN over and over my db grows
accordingly however the amount of urls actually
fetched doesn't equal the count of the DB total urls.

??
-byron


-------------------------------------------------------
This SF.Net email sponsored by Black Hat Briefings & Training.
Attend Black Hat Briefings & Training, Las Vegas July 24-29 - 
digital self defense, top technical experts, no vendor pitches, 
unmatched networking opportunities. Visit www.blackhat.com
_______________________________________________
Nutch-developers mailing list
[EMAIL PROTECTED]
https://lists.sourceforge.net/lists/listinfo/nutch-developers

Reply via email to