Re: [Mira] Commented: (LUCENE-1555) Deadlock while optimize

2009-03-11 Thread pdecrem
-Original Message- From: "Michael McCandless (JIRA)" Date: Wed, 11 Mar 2009 05:56:50 To: Subject: [jira] Commented: (LUCENE-1555) Deadlock while optimize [ https://issues.apache.org/jira/browse/LUCENE-1555?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&fo

Re: BloomFilter-s with Lucene

2009-01-30 Thread pdecrem
Well. I used 2 Broder similarity measures, and it works well. You obviously need to pick the right size bf's. Navendu Jain has a paper called using bloomfilters to refine web search results, which I think is relevant here. I talks about how remove near duplicate search results using bf's. --