Hi Doug,
I copy a working index and merge the original and the old together.
Than I run the dedub over these index. Shouldn't the dedub tool
remove the duplicates in the merged index?
Thanks,
Stefan
Am 24.10.2005 um 21:25 schrieb Doug Cutting:
It works for me. It currently only deletes md5 duplicates, but url
duplicates are currently handled elsewhere in the mapred branch.
What problems did you see?
Doug
Stefan Groschupf wrote:
Hi,
what is the status of the dedub tool in the mapreduce branche.
The javadoc mentioned that the second part isn't implemented but
the indexer will take about this issue anyway.
However I tried this tool and it looks like that it does not work
correctly.
Thanks for a comment.
Stefan
-------------------------------------------------------
This SF.Net email is sponsored by the JBoss Inc.
Get Certified Today * Register for a JBoss Training Course
Free Certification Exam for All Training Attendees Through End of 2005
Visit http://www.jboss.com/services/certification for more information
_______________________________________________
Nutch-developers mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/nutch-developers