Is this the paper that you are refering to? A. Chowdhury, D. Grossman, O. Frieder, C. McCabe, "Document Normalization Revisited" , ACM-SIGIR, August 2002. http://ir.iit.edu/~abdur/publications/p381-chowdhury.pdf
-Sean Doron Cohen wrote on 6/30/2007, 4:56 AM: > In particular for TREC > data, I've read some (can't find the link now) comparison of the > performance of few systems, concluding that for that specific collection > the probability of a document to be relevant correlates to its length, so > longer docs are more probable to be relevant, and a system punishing long > docs too much would get poorer results. --------------------------------------------------------------------- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]