I am thinking of modifying lucene's current ranking algorithm to include the 
document's recency-weightage. So that the latest modified documents gets 
preference over earlier modified documents, which makes sense for news search. 

(I believe) To do this I have to tinker with TermScorer.score() method, and 
calculate document-score  in its while (doc < end) {..} loop. The requirement 
is that document's lastModifiedTime is stored in the doc's field, and 
extracting this value could be quite expensive for every iteration in its 
posting stream. One approach could be to store it in a separate file (like 
Normalization) to avoid field-lookup. 

Any other ideas/suggestions.. Or if anyone has already implemented this ? 

thanks,
Prasen

---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Reply via email to