Add support for hash based exact/near duplicate document handling
-----------------------------------------------------------------

                 Key: SOLR-799
                 URL: https://issues.apache.org/jira/browse/SOLR-799
             Project: Solr
          Issue Type: New Feature
          Components: update
            Reporter: Mark Miller
            Priority: Minor


Hash based duplicate document detection is efficient and allows for blocking as 
well as field collapsing. Lets put it into solr. 

http://wiki.apache.org/solr/Deduplication

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to