Yes, of course it makes sense. I was just confused about the documentation for the Similarity function.
On Wed, Dec 3, 2008 at 9:52 AM, Erick Erickson <[EMAIL PROTECTED]>wrote: > I'm not much of an expert on term frequencies and scoring, > but would you really want the score calculated for a document > to be affected by the occurrence of terms in a field you did > NOT search on? > > I sure wouldn't, > > > Best > Erick > > On Wed, Dec 3, 2008 at 10:44 AM, Gustavo Corral <[EMAIL PROTECTED] > >wrote: > > > Hi list, > > > > I hope this is not a silly question, but I should ask. > > > > I developed a IR system for XML documents with Lucene and I was checking > > the > > explain() output for some queries, but I don't understand this part: > > > > 0.121383816 = fieldWeight(title:efecto in 1), product of: > > 1.0 = tf(termFreq(title:efecto)=1) > > 0.7768564 = idf(docFreq=4) > > > > It suppose tf refears to the term's frequency in the document, but I know > > there are more than one occurrences of this term in this document, so I > > noted that termFreq stores the frequency of a term in the document, but > > just > > for the given field, like if Lucene takes a field like a whole document. > Is > > this correct, and if that is the case is this a good practice for IR? > > > > Thanks for any help or a good explanation. > > >