-Original Message-
From: "Michael McCandless (JIRA)"
Date: Wed, 11 Mar 2009 05:56:50
To:
Subject: [jira] Commented: (LUCENE-1555) Deadlock while optimize
[
https://issues.apache.org/jira/browse/LUCENE-1555?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&fo
Well. I used 2 Broder similarity measures, and it works well. You obviously
need to pick the right size bf's.
Navendu Jain has a paper called using bloomfilters to refine web search
results, which I think is relevant here. I talks about how remove near
duplicate search results using bf's.
--