Re: custom attributs in tokens

2010-11-25 Thread Jan Kurella
Hi Simon, On 25.11.2010 10:40, ext Simon Willnauer wrote: Hi Jan, On Wed, Nov 24, 2010 at 9:12 AM, wrote: Of course: We are trying to search in documents that contain text in several languages. We are also investigating other approaches*, so this is not about finding other variants. the go

DisMaxQuery calculating too high sumOfSquaredWeights?

2010-11-26 Thread Jan Kurella
Hi there, I was composing a Query like the Solr.DisMaxQueryHandler would do on my own as I needed a different Tokenizing strategy for non whitespace separated languages and more. The concept I took from http://www.lucidimagination.com/blog/2010/05/23/whats-a-dismax/ Assume now the following:

Re: DisMaxQuery calculating too high sumOfSquaredWeights?

2010-11-26 Thread Jan Kurella
On 26.11.2010 14:39, ext Jan Kurella wrote: Hi there, I was composing a Query like the Solr.DisMaxQueryHandler would do on my own as I needed a different Tokenizing strategy for non whitespace separated languages and more. The concept I took from http://www.lucidimagination.com/blog/2010/05

Re: DisMaxQuery calculating too high sumOfSquaredWeights?

2010-11-26 Thread Jan Kurella
On 26.11.2010 14:50, ext Jan Kurella wrote: On 26.11.2010 14:39, ext Jan Kurella wrote: Hi there, I was composing a Query like the Solr.DisMaxQueryHandler would do on my own as I needed a different Tokenizing strategy for non whitespace separated languages and more. The concept I took from

SpanOrQuery with unreasonable high scores

2010-12-07 Thread Jan Kurella
Hi, From what I got, the SpanOrQuery is similar to a BooleanQuery with Or-Clauses. It can be used in other span queries. But compared to the BooleanQuery it has a significant difference: It is missing a querynormalization. Thus by the just summed up idf of all tokens in the query, even documen