Re: Minimum score filter

2011-08-16 Thread Erick Erickson
Right, so you're using edismax? This is expected. You can do
a number of things:

1 change the parameters of edismax
2 have your app filter out returns that dive beneath some threshold
 that is relative to the score of the first doc in the list.

But I don't see why, given your example, it makes a difference. Because
you got the document you were looking for in this instance says nothing
abou the next time when you search on something different

Best
Erick

On Mon, Aug 15, 2011 at 2:22 PM, Donald J. Organ IV
dor...@donaldorgan.com wrote:
 OK I am doing a search using the following fields     name^2.0 code^1.8 
 cat_search^1.5 description^0.8

 I am searching for:   free range dog nips

 I am getting back 2 documents the first is the document I am looking for, and 
 contains those works in the name field, as the name field is Free Range Dog 
 Nip Chicken Breast Wraps


 The second looks like its matching because those words are contained within 
 the description.



 - Original Message -
 From: simon mtnes...@gmail.com
 To: solr-user@lucene.apache.org
 Sent: Monday, August 15, 2011 1:59:17 PM
 Subject: Re: Minimum score filter

 The absolute value of a relevance score doesn't have a lot of meaning and
 the range of scores can vary a lot depending on any boost you may apply.
 Even if you normalize them (say on a 1-100 scale where 100 is the max
 relevance) you can't really draw any valid conclusions from those values.

 It would help if you described exactly what problem you're trying to solve.

 -Simon

 On Mon, Aug 15, 2011 at 1:02 PM, Donald J. Organ IV



Minimum score filter

2011-08-15 Thread Donald J. Organ IV
Is there a way to set a minimum score requirement so that matches below a given 
score are not return/included in facet counts. 

Re: Minimum score filter

2011-08-15 Thread simon
The absolute value of a relevance score doesn't have a lot of meaning and
the range of scores can vary a lot depending on any boost you may apply.
Even if you normalize them (say on a 1-100 scale where 100 is the max
relevance) you can't really draw any valid conclusions from those values.

It would help if you described exactly what problem you're trying to solve.

-Simon

On Mon, Aug 15, 2011 at 1:02 PM, Donald J. Organ IV
dor...@donaldorgan.comwrote:

 Is there a way to set a minimum score requirement so that matches below a
 given score are not return/included in facet counts.


Re: Minimum score filter

2011-08-15 Thread Donald J. Organ IV
OK I am doing a search using the following fields name^2.0 code^1.8 
cat_search^1.5 description^0.8

I am searching for:   free range dog nips

I am getting back 2 documents the first is the document I am looking for, and 
contains those works in the name field, as the name field is Free Range Dog 
Nip Chicken Breast Wraps


The second looks like its matching because those words are contained within the 
description.



- Original Message -
From: simon mtnes...@gmail.com
To: solr-user@lucene.apache.org
Sent: Monday, August 15, 2011 1:59:17 PM
Subject: Re: Minimum score filter

The absolute value of a relevance score doesn't have a lot of meaning and
the range of scores can vary a lot depending on any boost you may apply.
Even if you normalize them (say on a 1-100 scale where 100 is the max
relevance) you can't really draw any valid conclusions from those values.

It would help if you described exactly what problem you're trying to solve.

-Simon

On Mon, Aug 15, 2011 at 1:02 PM, Donald J. Organ IV


Re: Minimum Score

2011-08-05 Thread Kissue Kissue
But that would mean returning all the results without pagination which i
dont want to do. I am looking for a way to do it without having to return
all the results at once.

Thanks.

On Thu, Aug 4, 2011 at 11:18 PM, Darren Govoni dar...@ontrenet.com wrote:

 Off the top of my head you maybe you can get the number of results and
 then
 look at the last document and check its score. I believe the results will
 be ordered by score?


 On 08/04/2011 05:44 PM, Kissue Kissue wrote:

 Hi,

 I am using Solr 3.1 with the SolrJ client library. I can see that it is
 possible to get the maximum score for your search by using the following:

 response.getResults().**getMaxScore()

 I am wondering is there some simple solution to get the minimum score?

 Many thanks.





Re: Minimum Score

2011-08-05 Thread Martijn v Groningen
As far as I know there is no built-in solution for this like there is for
max score.
An alternative approach to the one already mentioned is to send a second
request with rows=1 and sort=score asc
This will return the lowest scoring document and you can then retrieve the
score from that document (if fl=*, score).

Martijn

On 5 August 2011 10:45, Kissue Kissue kissue...@gmail.com wrote:

 But that would mean returning all the results without pagination which i
 dont want to do. I am looking for a way to do it without having to return
 all the results at once.

 Thanks.

 On Thu, Aug 4, 2011 at 11:18 PM, Darren Govoni dar...@ontrenet.com
 wrote:

  Off the top of my head you maybe you can get the number of results and
  then
  look at the last document and check its score. I believe the results will
  be ordered by score?
 
 
  On 08/04/2011 05:44 PM, Kissue Kissue wrote:
 
  Hi,
 
  I am using Solr 3.1 with the SolrJ client library. I can see that it is
  possible to get the maximum score for your search by using the
 following:
 
  response.getResults().**getMaxScore()
 
  I am wondering is there some simple solution to get the minimum score?
 
  Many thanks.
 
 
 




-- 
Met vriendelijke groet,

Martijn van Groningen


Re: Minimum Score

2011-08-04 Thread Kissue Kissue
Hi,

I am using Solr 3.1 with the SolrJ client library. I can see that it is
possible to get the maximum score for your search by using the following:

response.getResults().getMaxScore()

I am wondering is there some simple solution to get the minimum score?

Many thanks.


Re: Minimum Score

2011-08-04 Thread Darren Govoni
Off the top of my head you maybe you can get the number of results and 
then
look at the last document and check its score. I believe the results 
will be ordered by score?


On 08/04/2011 05:44 PM, Kissue Kissue wrote:

Hi,

I am using Solr 3.1 with the SolrJ client library. I can see that it is
possible to get the maximum score for your search by using the following:

response.getResults().getMaxScore()

I am wondering is there some simple solution to get the minimum score?

Many thanks.





Re: Possible to set minimum score/relevance?

2006-10-17 Thread bo_b


Chris Hostetter wrote:
 
 :  But searches in the vbulletin database returns an awful lots of
 hits(like
 :  100.000+ for some queries), even though perhaps only the first handful
 seem
 :  relevant. So ideally we would like the link to say Found 12 extra
 results
 :  in vbulletin, if the first 12 results had a high score, and result 13
 to
 :  100.000 had a low score.
 
 the reasl question is, are ou just going to display that text, or is it
 going to be a link to the actual search: if you've going to give the user
 a link, then you're going to want to make sure the page they get to
 matches up with their expecation from the link text, so saying there are
 only 12 results when there are really 100.000 is going to be a bold faced
 lie -- what you should do is re-evaluate your query structure so that you
 only get the really good results (the 12) and have optional UI elements
 allowing people to relax the search criteria to get the full 100K.
 
 what criteria you should use to keep the results set small really depends
 on how you define good results vs bad results.
 
 -Hoss
 

There will be a link to the actual search, and I agree the number of results
on the result page needs to be the same as the text in the link says.

But anyway we just discovered that using the minimum match feature of the
dismax request handler allows us to narrow the amount of search results down
quite a bit. 

Because we take a users query and expand it semantically through an external
component(a bit like the synonym.txt file, but with weights assigned to each
of the synonyms) before feeding it to solr, so a 2 word query might end up
being a 10 word query, and this cause a huge increase in results.

Using the minium match feature seems to work out well for bringing it back
down to realistic levels.

Thanks,
Bo
-- 
View this message in context: 
http://www.nabble.com/Possible-to-set-minimum-score-relevance--tf2451869.html#a6855044
Sent from the Solr - User mailing list archive at Nabble.com.



Possible to set minimum score/relevance?

2006-10-16 Thread bo_b

Hello,

I was wondering if it was possible to set a minimum score/relevance for
search results? And how is the score calculated anyway? I thought i read
somewhere that lucene scores were normalized between 0..1, but that doesnt
seem to be the case for solr?

In our case we have indexed a 7 million posts vbulletin database. On a
search page we have, we would like to be able to have a sidebar which
includes a link to our vbulletin search that says Found  extra results
in vbulletin. 

But searches in the vbulletin database returns an awful lots of hits(like
100.000+ for some queries), even though perhaps only the first handful seem
relevant. So ideally we would like the link to say Found 12 extra results
in vbulletin, if the first 12 results had a high score, and result 13 to
100.000 had a low score.

Best regards,
Bo
-- 
View this message in context: 
http://www.nabble.com/Possible-to-set-minimum-score-relevance--tf2451869.html#a6833454
Sent from the Solr - User mailing list archive at Nabble.com.



Re: Possible to set minimum score/relevance?

2006-10-16 Thread Yonik Seeley

On 10/16/06, bo_b [EMAIL PROTECTED] wrote:

I was wondering if it was possible to set a minimum score/relevance for
search results? And how is the score calculated anyway?


http://lucene.apache.org/java/docs/scoring.html

Making an arbitrary cuttoff mean something would be quite difficult.


I thought i read
somewhere that lucene scores were normalized between 0..1, but that doesnt
seem to be the case for solr?


Solr never normalizes scores since it may be easily done by the client
- the maxScore is given in the results, so just divide all scores by
maxScore.  If Solr normalized scores, information would be thrown away
and clients wouldn't be able to un-normalize if needed.


In our case we have indexed a 7 million posts vbulletin database. On a
search page we have, we would like to be able to have a sidebar which
includes a link to our vbulletin search that says Found  extra results
in vbulletin.

But searches in the vbulletin database returns an awful lots of hits(like
100.000+ for some queries), even though perhaps only the first handful seem
relevant. So ideally we would like the link to say Found 12 extra results
in vbulletin, if the first 12 results had a high score, and result 13 to
100.000 had a low score.


You could try to analyze the scores yourself and see if there is a
natural break.

-Yonik