Is this the paper that you are refering to?

A. Chowdhury, D. Grossman, O. Frieder, C. McCabe,  "Document 
Normalization Revisited" , ACM-SIGIR, August 2002.
http://ir.iit.edu/~abdur/publications/p381-chowdhury.pdf

-Sean

Doron Cohen wrote on 6/30/2007, 4:56 AM:

 > In particular for TREC
 > data, I've read some (can't find the link now) comparison of the
 > performance of few systems, concluding that for that specific collection
 > the probability of a document to be relevant correlates to its length, so
 > longer docs are more probable to be relevant, and a system punishing long
 > docs too much would get poorer results.




---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Reply via email to