27 apr 2006 kl. 18.41 skrev Doug Cutting:
karl wettin wrote:
Boost per position, et.c. sounds very expensive.
Indeed. It will probably nearly double the size of indexes and
also increase search time. But it is also very powerful. Consider
the posting representation Google describes on page 9 of http://
dbpubs.stanford.edu/pub/1998-8. The font-size stored there is in
effect a weight for each position.
The point is not that every index store this, but that it be
possible for some indexes to store this, or even more information
per position, by extending a public API.
Good point.
What will be required in the IndexReader? Is it enough to add getBoost
() in the TermEnum? How would the value be sent to the scorer?
---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]