stop words make dismax fail

2010-02-24 Thread Nagelberg, Kallin
I'm having a problem when users enter stopwords in their query. I'm using a 
dismax request handler against a field setup  like:

fieldType name=simpleText class=solr.TextField positionIncrementGap=100
  analyzer type=index
tokenizer class=solr.StandardTokenizerFactory/
filter class=solr.StopFilterFactory 
ignoreCase=true words=stopwords.txt enablePositionIncrements=true /
filter class=solr.LowerCaseFilterFactory/
filter class=solr.LengthFilterFactory min=2 
max=20 /
  /analyzer
  analyzer type=query
tokenizer class=solr.WhitespaceTokenizerFactory/
filter class=solr.StopFilterFactory 
ignoreCase=true words=stopwords.txt enablePositionIncrements=true /
filter class=solr.SynonymFilterFactory 
synonyms=synonyms.txt ignoreCase=true expand=true/
filter class=solr.LowerCaseFilterFactory/
filter class=solr.LengthFilterFactory min=2 
max=20 /
  /analyzer
/fieldType



The problem is that when a user enters a query like 'meet the president', zero 
results are returned. I imagine it has something to do with 'the' being 
stripped out, then only 2 of the 3 terms matching. As a temporary workaround I 
set minshouldmatch to 1 so I do get results. That causes other problems though, 
such as 'the' never being highlighted in the results. Am I doing something 
totally wrong?

Thanks,
Kallin Nagelberg


Re: stop words make dismax fail

2010-02-24 Thread Ahmet Arslan
 The problem is that when a user enters a query like 'meet
 the president', zero results are returned. I imagine it has
 something to do with 'the' being stripped out, then only 2
 of the 3 terms matching. As a temporary workaround I set
 minshouldmatch to 1 so I do get results. That causes other
 problems though, such as 'the' never being highlighted in
 the results. Am I doing something totally wrong?


Recent discussion about this:
http://www.mail-archive.com/solr-user@lucene.apache.org/msg32445.html