Re: Whither Query Norm?

Mark Miller Fri, 20 Nov 2009 17:03:16 -0800

Go back and put it in after you have all the documents for that commitpoint. Or on reader load, calculate it.


- Mark


http://www.lucidimagination.com (mobile)

On Nov 20, 2009, at 7:56 PM, Jake Mannix <[email protected]> wrote:

On Fri, Nov 20, 2009 at 4:51 PM, Mark Miller <[email protected]>wrote:Okay - my fault - I'm not really talking in terms of Lucene. Thougheventhere I consider it possible. You'd just have to like, rewrite it :)And
it would likely be pretty slow.
Rewrite it how? When you index the very first document, the docFreqof allterms is 1, out of numDocs = 1 docs in the corpus. Everybody's idfis the same.No matter how you normalize this, it'll be wrong, once you'veindexed a milliondocuments. This isn't a matter of Lucene architecture, it's amatter of idf beinga query-time exactly available value (you can approximate it partwaythrough
indexing, but you don't know it at all when you start).

  -jake

Re: Whither Query Norm?

Reply via email to