Doug,

(1) No, I did *not* boost the pub_date field, either in the indexing process
or in the query itself.

(2) And, each pub_date field of each document (which is in XML format)
contains only one instance of the date string.

(3) And only the pub_date field itself is indexed.  There are other
attributes of this field that may contain the date string, but they aren't
indexed - that is, they are not included in the instantiated Document class.

Regards,

Terry

----- Original Message -----
From: "Doug Cutting" <[EMAIL PROTECTED]>
To: "Lucene Users List" <[EMAIL PROTECTED]>
Sent: Wednesday, September 17, 2003 5:51 PM
Subject: Re: Lucene Scoring Behavior


> Terry Steichen wrote:
> >   0.03125 = fieldNorm(field=pub_date, doc=90992)
> >   1.0 = fieldNorm(field=pub_date, doc=90970)
>
> It looks like the fieldNorm's are what differ, not the IDFs.  These are
> the product of the document and/or field boost, and 1/sqrt(numTerms)
> where numTerms is the number of terms in the "pub_date" field of the
> document.  Thus if each document is only assigned one date, and you
> didn't boost the field or the document when you indexed it, this should
> be 1.0.  But if the document has two dates, then this would be
> 1/sqrt(2).  Or if you boosted this document pub_date field, then this
> will have whatever boost you provided.
>
> So, did you boost anything when indexing?  Or could a single document
> have two or more different values for pub_date?  Either would explain
this.
>
> Doug
>
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: [EMAIL PROTECTED]
> For additional commands, e-mail: [EMAIL PROTECTED]
>


---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Reply via email to