Re: Please Help !! Question about Query Phrase Slop (qs) in dismax

2008-11-24 Thread Chris Hostetter

: Subject: Re: Please Help !! Question about Query Phrase Slop (qs) in dismax
: 
: 
: Please help someone...i've been waiting for an answer for the last couple of
: days  no one seems to be helping out here. I did search the wiki  this

Please don't send messages like this.  

This is a volunteer community -- no one (that I know of) is paid to 
read/reply to questions on the solr-user list.  Many of us do our best to 
make sure that all user questions get addressed, but this is a fairly high 
volume list, and sometimes other things in life (work, health, 
relationships, family, etc...) make that take a little longer then we 
would like -- sometimes questions don't get answered for a few days, it's 
just the way it is, please be patient.  Sending multiple please help, 
still no reply type messages just adds noise to the list, and give people 
who *do* want to help more to read which means it takes that much longer 
to actually reply.

If you need an answer to a question in a hurry: read the archives and the 
docs, experiment, read the code (if you know java), or hire a consultant 
to help you figure it out.

In this specific case, debugQuery=true would have quickly shown you that 
your qs=5 value wasn't making it's way into the parsedquery at all, 
which might have helped you understand what was happening.



-Hoss



Re: Please Help !! Question about Query Phrase Slop (qs) in dismax

2008-11-23 Thread anuvenk

Thanks for the response. Well my current ps setting works great for most
search terms. But say this typical example, north dakota 1031 exchange
lawyers - we don't have any relevant docs in the index. Solr is returning
the irrelevant doc, just because it found 'lawyer', exchange, north  dakota
somewhere. I thought if there is a way to just not return any results if
they are not within close proximity, it would be great. 

Yonik Seeley wrote:
 
 On Sun, Nov 23, 2008 at 11:51 PM, anuvenk [EMAIL PROTECTED]
 wrote:
 Please help someone...i've been waiting for an answer for the last couple
 of
 days  no one seems to be helping out here. I did search the wiki  this
 forum for an answer. But couldn't find an answer. I know if ps is set to
 5
 words within 5 words of one another receive a boost in score. But is
 there a
 way to not return results that have the words in search terms more than 5
 words apart. ?
 
 Not with dismax.  I'm not sure why it's a problem, given that with
 enough boost you should be able to ensure that all of the results with
 a slop less than 5 appear before other results.
 Anyway, if you want to restrict results to those with a slop of 5, use
 the standard query parser with an explicit sloppy phrase query:
 
 north dakota 1031 exchange lawyers~5
 
 -Yonik
 
 
 Typical example: north dakota 1031 exchange lawyers
 My first result is absolutely ir-relevant. It returned a north dakota doc
 though but had an occurrence of attorney somewhere  an occurrence of
 exchange (not related to 1031 exchange though). They were not within 5
 words
 of one another. My guys have been hammering me reg this relevancy issue.
 Please help someone.

 anuvenk wrote:

 From the solr wiki, it sounded like if qs is set to 5 for example,  if
 the search term is 'child custody', only docs with 'child'  'custody'
 within 5 words of one another would be returned in results. Is this
 correct? If so, it doesn't seem to be working for me. I see docs with
 'child'  'custody' more than 5 words of one another (excluding stop
 words) which is resulting in bad user experience as those docs are not
 so
 relevant. What more could i do to improve quality in the results?


 --
 View this message in context:
 http://www.nabble.com/Please-Help-%21%21-Question-about-Query-Phrase-Slop-%28qs%29-in-dismax-tp20643003p20654906.html
 Sent from the Solr - User mailing list archive at Nabble.com.


 
 

-- 
View this message in context: 
http://www.nabble.com/Please-Help-%21%21-Question-about-Query-Phrase-Slop-%28qs%29-in-dismax-tp20643003p20655014.html
Sent from the Solr - User mailing list archive at Nabble.com.



Re: Please Help !! Question about Query Phrase Slop (qs) in dismax

2008-11-23 Thread Yonik Seeley
If you boost the phrase queries by enough, you could tell when you hit
the less relevant documents by the score.

-Yonik

On Mon, Nov 24, 2008 at 12:07 AM, anuvenk [EMAIL PROTECTED] wrote:

 Thanks for the response. Well my current ps setting works great for most
 search terms. But say this typical example, north dakota 1031 exchange
 lawyers - we don't have any relevant docs in the index. Solr is returning
 the irrelevant doc, just because it found 'lawyer', exchange, north  dakota
 somewhere. I thought if there is a way to just not return any results if
 they are not within close proximity, it would be great.

 Yonik Seeley wrote:

 On Sun, Nov 23, 2008 at 11:51 PM, anuvenk [EMAIL PROTECTED]
 wrote:
 Please help someone...i've been waiting for an answer for the last couple
 of
 days  no one seems to be helping out here. I did search the wiki  this
 forum for an answer. But couldn't find an answer. I know if ps is set to
 5
 words within 5 words of one another receive a boost in score. But is
 there a
 way to not return results that have the words in search terms more than 5
 words apart. ?

 Not with dismax.  I'm not sure why it's a problem, given that with
 enough boost you should be able to ensure that all of the results with
 a slop less than 5 appear before other results.
 Anyway, if you want to restrict results to those with a slop of 5, use
 the standard query parser with an explicit sloppy phrase query:

 north dakota 1031 exchange lawyers~5

 -Yonik


 Typical example: north dakota 1031 exchange lawyers
 My first result is absolutely ir-relevant. It returned a north dakota doc
 though but had an occurrence of attorney somewhere  an occurrence of
 exchange (not related to 1031 exchange though). They were not within 5
 words
 of one another. My guys have been hammering me reg this relevancy issue.
 Please help someone.

 anuvenk wrote:

 From the solr wiki, it sounded like if qs is set to 5 for example,  if
 the search term is 'child custody', only docs with 'child'  'custody'
 within 5 words of one another would be returned in results. Is this
 correct? If so, it doesn't seem to be working for me. I see docs with
 'child'  'custody' more than 5 words of one another (excluding stop
 words) which is resulting in bad user experience as those docs are not
 so
 relevant. What more could i do to improve quality in the results?


 --
 View this message in context:
 http://www.nabble.com/Please-Help-%21%21-Question-about-Query-Phrase-Slop-%28qs%29-in-dismax-tp20643003p20654906.html
 Sent from the Solr - User mailing list archive at Nabble.com.





 --
 View this message in context: 
 http://www.nabble.com/Please-Help-%21%21-Question-about-Query-Phrase-Slop-%28qs%29-in-dismax-tp20643003p20655014.html
 Sent from the Solr - User mailing list archive at Nabble.com.