Hello,

I have a set of documents in which I have different classes of fields that I
would like to search separately. For example, I would like to search the
HTML body and title of a webpage using one set of keywords, and the page
author using another set. I cannot use the dismax parser for this problem
because all keywords will search across all the query fields. However, I
like the dismax query parser because it handles matching and scoring very
nicely.

I read one suggestion on this group which was to make one of the queries a
query filter. So for example, I may use the dismax query parser to search
the body and title of a webpage, then use a filter query for the author.
There are two problems with this approach in regards to what I need:
  1. The filter query does not affect scoring, but I need the scoring to be
influenced by the results of all fields being searched.
  2. A filter query will do a simple AND or OR filter, while I would need
the search to be an OR search with higher scoring for multiple matches
(related to the first problem).

I think what I need is a dismax parser into which the parsed query will not
just contain all keywords for all fields, but into which you can specify
which fields correspond to which sets of keywords. Has anything like this
been tackled before? If not, can someone help point me in the right
direction for how I would build this myself? Thanks very much for your time.

Regards,

Mark Ferguson

Reply via email to