Dear List,
I counted the pages in the segments:
bin/nutch segread -fix -list -dir segments
the sum of results is: 11 million pages - 'dedup' removes 2 million = 9
million pages.
When I search in the frontend with "http" the result is 6 million, how to
find the missing 3 million pages?
How to count the total number of searchable pages in the search
server?
Best Regards,
Ferenc
-------------------------------------------------------
This SF.Net email is sponsored by Oracle Space Sweepstakes
Want to be the first software developer in space?
Enter now for the Oracle Space Sweepstakes!
http://ads.osdn.com/?ad_id=7393&alloc_id=16281&op=click
_______________________________________________
Nutch-general mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/nutch-general