Ben Halsted wrote:
I've modified the auto-crawl to always use a pre-existing crawldb. If I run
it multiple times I get multiple linkdb, segments, indexes, and index
directories.
Is it possible to merge the results using the bin/nutch comamnds?
You should also have it use a single linkdb. Then use 'bin/nutch dedup'
and 'bin/nutch merge' across both indexes directories to create a new
index with everything.
Doug
-------------------------------------------------------
This SF.Net email is sponsored by the JBoss Inc. Get Certified Today
Register for a JBoss Training Course. Free Certification Exam
for All Training Attendees Through End of 2005. For more info visit:
http://ads.osdn.com/?ad_id=7628&alloc_id=16845&op=click
_______________________________________________
Nutch-general mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/nutch-general