On Wed, Jul 7, 2010 at 11:49 PM, Grant Ingersoll <[email protected]>wrote:
> How do you want to determine copy? Strictly or loosely? Solr and Nutch > have some deduplication capabilities, including fuzzy matching. They > probably could be brought into Mahout, too. > > -Grant > > > Dear Grant I am trying to make a strict match. I will try Solar and Nutch. Thanks and Regards -- ********************************** JAGANADH G http://jaganadhg.freeflux.net/blog
