Hi all,
I created a segment merging tool, to remove duplicated or otherwise not used content from several segments and joining them together into a single new segment. The tool also optionally performs several other steps required for proper operation of Nutch - such as indexing segments, deleting duplicates, merging indices, and indexing the new single segment.
The file has been submitted here:
http://sourceforge.net/tracker/index.php?func=detail&aid=986775&group_id=59548&atid=491356
-- Best regards, Andrzej Bialecki
------------------------------------------------- Software Architect, System Integration Specialist CEN/ISSS EC Workshop, ECIMF project chair EU FP6 E-Commerce Expert/Evaluator ------------------------------------------------- FreeBSD developer (http://www.freebsd.org)
-------------------------------------------------------
This SF.Net email sponsored by Black Hat Briefings & Training.
Attend Black Hat Briefings & Training, Las Vegas July 24-29 - digital self defense, top technical experts, no vendor pitches, unmatched networking opportunities. Visit www.blackhat.com
_______________________________________________
Nutch-developers mailing list
[EMAIL PROTECTED]
https://lists.sourceforge.net/lists/listinfo/nutch-developers
