Re: TermsFilter instead of "should" TermQueries

2010-05-07 Thread Erick Erickson
Well, you construct the filter by enumerating the terms you're interested in and pass it along to the relevant search. But it looks like you've figured that part out. If you're asking how can you use a Filter and still have the terms replaced by the filter contribute to scoring, you can't. But it'

Filter vs. TermQuery performance

2010-05-07 Thread Tomislav Poljak
Hi, when is it wise to replace a TermQuery with cached Filter (regarding search performance). If TermQuery is used only to filter results based on field value (it doesn't participate in scoring), is it alway wise to replace it with filter? Is it only wise if Filter is cached (wrapped in CachingWrap

Speakers and Schedule for Berlin Buzzwords 2010 - Search, Store and Scale 7th/8th 2010

2010-05-07 Thread Simon Willnauer
Hi folks, Today we proudly present the Berlin Buzzwords talks and presentations. As promised there are tracks specific to the three tags search, store and scale. We have a fantastic mixture of developers and users of open source software projects that make scaling data processing today possible.

TermsFilter instead of "should" TermQueries

2010-05-07 Thread Tomislav Poljak
Hi, in API documentation for TermsFilter: http://search-lucene.com/jd/lucene/org/apache/lucene/search/TermsFilter.html it states: 'As a filter, this is much faster than the equivalent query (a BooleanQuery with many "should" TermQueries)' I would like to replace "should" TermQueries with TermsF

Re: Trace only exactly matching terms!

2010-05-07 Thread Erick Erickson
The other approach is to use a stemmer both at index and query time. BTW, it's very easy to make a "custom" analyzer by chaining together the Tokenizer and as many filters (e.g. PorterStemFilter), essentially composing your analyzer from various pre-built Lucene parts. HTH Erick On Fri, May 7, 2

Re: Term/Phrase frequencies

2010-05-07 Thread Erick Erickson
Well, counting frequency isn't the best approach. For instance, if a field has 1,000 terms and 10 occurrences of your target, is that a better match than a field with 10 terms and 5 occurrences of your target? This kind of thing is already taken into account with Lucene scoring, you might want to

Re: Trace only exactly matching terms!

2010-05-07 Thread Anshum
Hi Manjula, Yes lucene by default would only tackle exact term matches unless you use a custom analyzer to expand the index/query. -- Anshum Gupta http://ai-cafe.blogspot.com The facts expressed here belong to everybody, the opinions to me. The distinction is yours to draw On Fri, M

Trace only exactly matching terms!

2010-05-07 Thread manjula wijewickrema
Hi, I am using Lucene 2.9.1 . I have downloaded and run the 'HelloLucene.java' class by modifing the input document and user query in various ways. Once I put the document sentenses as 'Lucene in actions' insted of 'Lucene in action', and I gave the query as 'action' and run the programme. But it