Re: Calculating Solr document score by ignoring the boost field.

2013-07-23 Thread Chris Hostetter

: Ok thanks, I just wanted the know is it possible to ignore boost value or
: not during score calculation and as you said its not.
: Now I would have to focus on nutch to fix the issue and not to send boost=0
: to Solr.

the index time bosts are encoded in field norms -- if you wnat to ignore 
them, you could either modify your schema to 'omitNOrms=true' on all 
fields *beore* indexing, or you could customize hte SImilarity 
implementation you use to be osmething custom that does not tke into 
account field norms at all.

https://cwiki.apache.org/confluence/display/solr/Other+Schema+Elements#OtherSchemaElements-Similarity

https://lucene.apache.org/core/4_3_0/core/org/apache/lucene/search/similarities/TFIDFSimilarity.html


-Hoss


Re: Calculating Solr document score by ignoring the boost field.

2013-07-10 Thread Daniel Collins
Sorry to repeat Jacks' previous answer but x times zero is always zero :)

A index boost is just what the name suggests, a factor by which the
document score is boosted (multiplied). Since it is in an index time value,
it is stored alongside the document, so any future scoring of the document
by any query will take this value into account. If you take Solr's internal
document score and then multiply it by zero, the result is by definition
zero...

What you seem to be saying is you are passing in an index time boost (which
is incorrect but that's an issue with Nutch), but you want Solr to ignore
it, surely the correct approach then is *not* to pass it in?

Once the data is indexed, it is fixed, unless you re-index the document,
so if that data is wrong, there is nothing Solr can do about it, you have
to re-index the documents that have incorrect data. If you want to just use
TF-IDF for scoring and not use boosting, don't supply any boosting, it's
that simple.  Sorry if this sounds repetitive, but can't think of any other
way to say it.


On 10 July 2013 06:33, Tony Mullins tonymullins...@gmail.com wrote:

 Jack due to 'some' reason my nutch is returning me index time boost =0.0
 and just for a moment suppose that nutch is and will always return boost
 =0.

 Now my simple question was why Solr is showing me document's score = 0 ?
 Why is it depending upon index time boost value ? Why or how to make Solr
 to only calculate the score value on TF-IDF ?

 Regards,
 Khan


 On Tue, Jul 9, 2013 at 6:31 PM, Jack Krupansky j...@basetechnology.com
 wrote:

  Simple math: x times zero equals zero.
 
  That's why the default document boost is 1.0 - score times 1.0 equals
  score.
 
  Any particular reason you wanted to zero out the document score from the
  document level?
 
  -- Jack Krupansky
 
  -Original Message- From: Tony Mullins
  Sent: Tuesday, July 09, 2013 9:23 AM
  To: solr-user@lucene.apache.org
  Subject: Re: Calculating Solr document score by ignoring the  field.
 
 
  I am passing boost value (via nutch) and i.e boost =0.0.
  But my question is why Solr is showing me score = 0.0 when my boost
 (index
  time boost) = 0.0 ?
  Should not Solr calculate its documents score on the basis of TF-IDF ?
 And
  if not how can I make Solr to only consider TF-IDF while calculating
  document's score ?
 
  Regards,
  Khan
 
 
  On Tue, Jul 9, 2013 at 4:46 PM, Erick Erickson erickerick...@gmail.com
 **
  wrote:
 
   My guess is that you're not really passing on the boost field's value
  and getting the default. Don't quite know how I'd track that down
  though
 
  Best
  Erick
 
  On Tue, Jul 9, 2013 at 4:09 AM, imran khan imrankhan.x...@gmail.com
  wrote:
   Greetings,
  
   I am using nutch 2.x as my datasource for Solr 4.3.0. And nutch passes
   on
   its own boost field to my Solr schema
  
   field name=boost type=float stored=true indexed=false/
  
   Now due to some reason I always get boost = 0.0 and due to this my
  Solr's
   document score is also always 0.0.
  
   Is there any way in Solr that it ignores the boost field's value for
  its
   document's score calculation ?
  
   Regards,
   Khan
 
 
 



Re: Calculating Solr document score by ignoring the boost field.

2013-07-10 Thread Tony Mullins
Ok thanks, I just wanted the know is it possible to ignore boost value or
not during score calculation and as you said its not.
Now I would have to focus on nutch to fix the issue and not to send boost=0
to Solr.

Regards,
Khan


On Wed, Jul 10, 2013 at 12:14 PM, Daniel Collins danwcoll...@gmail.comwrote:

 Sorry to repeat Jacks' previous answer but x times zero is always zero :)

 A index boost is just what the name suggests, a factor by which the
 document score is boosted (multiplied). Since it is in an index time value,
 it is stored alongside the document, so any future scoring of the document
 by any query will take this value into account. If you take Solr's internal
 document score and then multiply it by zero, the result is by definition
 zero...

 What you seem to be saying is you are passing in an index time boost (which
 is incorrect but that's an issue with Nutch), but you want Solr to ignore
 it, surely the correct approach then is *not* to pass it in?

 Once the data is indexed, it is fixed, unless you re-index the document,
 so if that data is wrong, there is nothing Solr can do about it, you have
 to re-index the documents that have incorrect data. If you want to just use
 TF-IDF for scoring and not use boosting, don't supply any boosting, it's
 that simple.  Sorry if this sounds repetitive, but can't think of any other
 way to say it.


 On 10 July 2013 06:33, Tony Mullins tonymullins...@gmail.com wrote:

  Jack due to 'some' reason my nutch is returning me index time boost =0.0
  and just for a moment suppose that nutch is and will always return boost
  =0.
 
  Now my simple question was why Solr is showing me document's score = 0 ?
  Why is it depending upon index time boost value ? Why or how to make Solr
  to only calculate the score value on TF-IDF ?
 
  Regards,
  Khan
 
 
  On Tue, Jul 9, 2013 at 6:31 PM, Jack Krupansky j...@basetechnology.com
  wrote:
 
   Simple math: x times zero equals zero.
  
   That's why the default document boost is 1.0 - score times 1.0 equals
   score.
  
   Any particular reason you wanted to zero out the document score from
 the
   document level?
  
   -- Jack Krupansky
  
   -Original Message- From: Tony Mullins
   Sent: Tuesday, July 09, 2013 9:23 AM
   To: solr-user@lucene.apache.org
   Subject: Re: Calculating Solr document score by ignoring the  field.
  
  
   I am passing boost value (via nutch) and i.e boost =0.0.
   But my question is why Solr is showing me score = 0.0 when my boost
  (index
   time boost) = 0.0 ?
   Should not Solr calculate its documents score on the basis of TF-IDF ?
  And
   if not how can I make Solr to only consider TF-IDF while calculating
   document's score ?
  
   Regards,
   Khan
  
  
   On Tue, Jul 9, 2013 at 4:46 PM, Erick Erickson 
 erickerick...@gmail.com
  **
   wrote:
  
My guess is that you're not really passing on the boost field's value
   and getting the default. Don't quite know how I'd track that down
   though
  
   Best
   Erick
  
   On Tue, Jul 9, 2013 at 4:09 AM, imran khan imrankhan.x...@gmail.com
   wrote:
Greetings,
   
I am using nutch 2.x as my datasource for Solr 4.3.0. And nutch
 passes
on
its own boost field to my Solr schema
   
field name=boost type=float stored=true indexed=false/
   
Now due to some reason I always get boost = 0.0 and due to this my
   Solr's
document score is also always 0.0.
   
Is there any way in Solr that it ignores the boost field's value
 for
   its
document's score calculation ?
   
Regards,
Khan
  
  
  
 



Calculating Solr document score by ignoring the boost field.

2013-07-09 Thread imran khan
Greetings,

I am using nutch 2.x as my datasource for Solr 4.3.0. And nutch passes on
its own boost field to my Solr schema

field name=boost type=float stored=true indexed=false/

Now due to some reason I always get boost = 0.0 and due to this my Solr's
document score is also always 0.0.

Is there any way in Solr that it ignores the boost field's value for its
document's score calculation ?

Regards,
Khan


Re: Calculating Solr document score by ignoring the boost field.

2013-07-09 Thread Erick Erickson
My guess is that you're not really passing on the boost field's value
and getting the default. Don't quite know how I'd track that down though

Best
Erick

On Tue, Jul 9, 2013 at 4:09 AM, imran khan imrankhan.x...@gmail.com wrote:
 Greetings,

 I am using nutch 2.x as my datasource for Solr 4.3.0. And nutch passes on
 its own boost field to my Solr schema

 field name=boost type=float stored=true indexed=false/

 Now due to some reason I always get boost = 0.0 and due to this my Solr's
 document score is also always 0.0.

 Is there any way in Solr that it ignores the boost field's value for its
 document's score calculation ?

 Regards,
 Khan


Re: Calculating Solr document score by ignoring the boost field.

2013-07-09 Thread Tony Mullins
I am passing boost value (via nutch) and i.e boost =0.0.
But my question is why Solr is showing me score = 0.0 when my boost (index
time boost) = 0.0 ?
Should not Solr calculate its documents score on the basis of TF-IDF ? And
if not how can I make Solr to only consider TF-IDF while calculating
document's score ?

Regards,
Khan


On Tue, Jul 9, 2013 at 4:46 PM, Erick Erickson erickerick...@gmail.comwrote:

 My guess is that you're not really passing on the boost field's value
 and getting the default. Don't quite know how I'd track that down
 though

 Best
 Erick

 On Tue, Jul 9, 2013 at 4:09 AM, imran khan imrankhan.x...@gmail.com
 wrote:
  Greetings,
 
  I am using nutch 2.x as my datasource for Solr 4.3.0. And nutch passes on
  its own boost field to my Solr schema
 
  field name=boost type=float stored=true indexed=false/
 
  Now due to some reason I always get boost = 0.0 and due to this my
 Solr's
  document score is also always 0.0.
 
  Is there any way in Solr that it ignores the boost field's value for
 its
  document's score calculation ?
 
  Regards,
  Khan



Re: Calculating Solr document score by ignoring the boost field.

2013-07-09 Thread Jack Krupansky

Simple math: x times zero equals zero.

That's why the default document boost is 1.0 - score times 1.0 equals score.

Any particular reason you wanted to zero out the document score from the 
document level?


-- Jack Krupansky

-Original Message- 
From: Tony Mullins

Sent: Tuesday, July 09, 2013 9:23 AM
To: solr-user@lucene.apache.org
Subject: Re: Calculating Solr document score by ignoring the  field.

I am passing boost value (via nutch) and i.e boost =0.0.
But my question is why Solr is showing me score = 0.0 when my boost (index
time boost) = 0.0 ?
Should not Solr calculate its documents score on the basis of TF-IDF ? And
if not how can I make Solr to only consider TF-IDF while calculating
document's score ?

Regards,
Khan


On Tue, Jul 9, 2013 at 4:46 PM, Erick Erickson 
erickerick...@gmail.comwrote:



My guess is that you're not really passing on the boost field's value
and getting the default. Don't quite know how I'd track that down
though

Best
Erick

On Tue, Jul 9, 2013 at 4:09 AM, imran khan imrankhan.x...@gmail.com
wrote:
 Greetings,

 I am using nutch 2.x as my datasource for Solr 4.3.0. And nutch passes 
 on

 its own boost field to my Solr schema

 field name=boost type=float stored=true indexed=false/

 Now due to some reason I always get boost = 0.0 and due to this my
Solr's
 document score is also always 0.0.

 Is there any way in Solr that it ignores the boost field's value for
its
 document's score calculation ?

 Regards,
 Khan





Re: Calculating Solr document score by ignoring the boost field.

2013-07-09 Thread Tony Mullins
Jack due to 'some' reason my nutch is returning me index time boost =0.0
and just for a moment suppose that nutch is and will always return boost =0.

Now my simple question was why Solr is showing me document's score = 0 ?
Why is it depending upon index time boost value ? Why or how to make Solr
to only calculate the score value on TF-IDF ?

Regards,
Khan


On Tue, Jul 9, 2013 at 6:31 PM, Jack Krupansky j...@basetechnology.comwrote:

 Simple math: x times zero equals zero.

 That's why the default document boost is 1.0 - score times 1.0 equals
 score.

 Any particular reason you wanted to zero out the document score from the
 document level?

 -- Jack Krupansky

 -Original Message- From: Tony Mullins
 Sent: Tuesday, July 09, 2013 9:23 AM
 To: solr-user@lucene.apache.org
 Subject: Re: Calculating Solr document score by ignoring the  field.


 I am passing boost value (via nutch) and i.e boost =0.0.
 But my question is why Solr is showing me score = 0.0 when my boost (index
 time boost) = 0.0 ?
 Should not Solr calculate its documents score on the basis of TF-IDF ? And
 if not how can I make Solr to only consider TF-IDF while calculating
 document's score ?

 Regards,
 Khan


 On Tue, Jul 9, 2013 at 4:46 PM, Erick Erickson erickerick...@gmail.com**
 wrote:

  My guess is that you're not really passing on the boost field's value
 and getting the default. Don't quite know how I'd track that down
 though

 Best
 Erick

 On Tue, Jul 9, 2013 at 4:09 AM, imran khan imrankhan.x...@gmail.com
 wrote:
  Greetings,
 
  I am using nutch 2.x as my datasource for Solr 4.3.0. And nutch passes
  on
  its own boost field to my Solr schema
 
  field name=boost type=float stored=true indexed=false/
 
  Now due to some reason I always get boost = 0.0 and due to this my
 Solr's
  document score is also always 0.0.
 
  Is there any way in Solr that it ignores the boost field's value for
 its
  document's score calculation ?
 
  Regards,
  Khan