On 12/1/06, Yonik Seeley <[EMAIL PROTECTED]> wrote:
On 12/1/06, Robin Bonin <[EMAIL PROTECTED]> wrote:
> The term Nike in a product title should have a lot higher relevance, than
> the term Nike somewhere in a product review.
> I would like to keep indexing the reviews, but I need to specify the title
> at a much higher rate. Is that possible?
> I can split the values into their own fields if that helps.

That would absolutely help.... then you could use the lucene
QueryParser syntax for boosting:

A query like the following will count matches in the title roughly 10
times as important:

title:Nike^10 review_body:Nike

The dismax handler also has a way of dealing with this (see the Wiki),
and that would be something to explore after you are comfortable with
standard Lucene syntax.

Note that you might want to try spliiting the text into fields and
doing no further boosting--usually the idf and length normalization
factor will give very high scores to title fields without explicit
boosting.

-Mike

Reply via email to