RE: basic questions on the new QueryParser

Uwe Schindler Mon, 03 Aug 2009 13:12:41 -0700

Thanks for this howto; maybe I can try to do this for LUCENE-1768 as some
exercise at the end of the week :-) - I was not able to look into the new
framework in detail (I only looked into the Powerpoints on Michael B's
Linkedin profile page), but it would be good to have a howto like this mail
somewhere in the wiki or docs.


 

If I read it correctly, the parsing of the query string is decoupled from
this. So somebody could also create another query syntax and reuse all
processors and builders?

 

In my opinion, Solr could also use this query parser framework and implement
all these special field-specific things like that.

 

-----
Uwe Schindler
H.-H.-Meier-Allee 63, D-28213 Bremen
http://www.thetaphi.de
eMail: u...@thetaphi.de

  _____  

From: Adriano Crestani [mailto:adrianocrest...@gmail.com] 
Sent: Monday, August 03, 2009 10:00 PM
To: java-dev@lucene.apache.org
Subject: Re: basic questions on the new QueryParser

 

I see the "original" package, and the useful OriginalQueryParserHelper
within there, that are meant to ease the transition off of the old
QueryParser to the new one.

First question: is the original package also intended to be Lucene's
"default" QueryParser, going forward?  Ie, most users will simply want
to get Lucene's default QueryParser and parse text into Query objs,
and then expert users will directly use core.* to make their own query
parsers.

If so, maybe we should rename "OriginalQueryParserHelper" to
"DefaultQueryParser" and perhaps move it up into oal.queryParser?

Yes, it's the Lucene's "default" query parser implemented using the new QP
framework.
We already discussed what name should be given to it (original, standard,
default, main, etc) on LUCENE-1567, that 
is why we picked the name "original", but there is no specific reason why we
picked it over the others.

Now, about moving it to oal.queryParser, I think it's a subjective decision.
I like it inside oal.queryParser.original, because
we can easily distinguish what is the "original" implementation. But like I
said, it's really subjective.

Second question: say I wanted to start with the DefaultQueryParser,
but tweak how it produces range queries.  EG the old QueryParser
attempted to parse the lower/upper bounds as Date, and then re-encodes
them using DateTools (LUCENE-1768 is already open to do this plus
NumericQuery "for real"... I'm just using it as a use case of how an
outside consumer of this API could tweak the default QueryParser).

If I want to do this using the new QueryParser, could I:

 * First get the DefaultQueryParser

 * Call getQueryBuilder to get its QueryBuilder

 * Call QueryBuilder.setBuilder(

RangeQueryNode.class, new
   MyRangeQueryNodeBuilder()), where MyRangeQueryNodeBuilder is my
   new class that handles the dates.

?

Ie, is that (swapping in your own QueryBuilder) the right way to tweak
how the DefaultQueryParser would create queries?

If you want to be able to create NumericQuery objects based on the field
data type (float, int, date, etc), which the user specified in some kind of
configuration, changing the builders would not be enough, for these 2
reasons:

1- Builders are meant to be used only to build the final objects from query
nodes, processing, like converting the range values to the correct data
format should be ideally be performed in a processor.

2- Builders have no access to configuration at build time, the only query
parser phase able to do that is the query processing phase.

So, you will need to:

- create an Attribute to hold the data type for each range field, so you can
set the data type on this attribute and add it to the QueryConfigHandler
(you may call originalQP.getQueryConfigHandler() to get it)

- create a processor that reads this configuration from the
QueryConfigHandler and for each RangeQueryNode object and convert their
values to the correct format

- add the processor to the current OriginalQueryParserPipeline (calling
defaultQP.getQueryNodeProcessor().addProcessor(new
MyNewRangeQueryNodeProcessor())

- create a builder that creates NumericQuery objects from RangeQueryNode and
add it to the builders' map:
defaultQP.getQueryBuilder().setBuilder(RangeQueryNode.class, new
MyRangeQueryNodeBuilder())

I think this is what should be done on LUCENE-1768. But, if you decide you
don't need to convert the range values based on the field data type
specified by the user and you want to hardcode that on the builder, you can
do the way you described. However, ideally, if you do any query processing,
like values conversion, you should create a processor for that, because they
were desinged for that.

Best Regards,
Adriano Crestani Campos

 

On Mon, Aug 3, 2009 at 11:12 AM, Michael McCandless
<luc...@mikemccandless.com> wrote:

Second question: say I wanted to start with the DefaultQueryParser,
but tweak how it produces range queries.  EG the old QueryParser
attempted to parse the lower/upper bounds as Date, and then re-encodes
them using DateTools (LUCENE-1768 is already open to do this plus
NumericQuery "for real"... I'm just using it as a use case of how an
outside consumer of this API could tweak the default QueryParser).

If I want to do this using the new QueryParser, could I:

 * First get the DefaultQueryParser

 * Call getQueryBuilder to get its QueryBuilder

 * Call QueryBuilder.setBuilder(RangeQueryNode.class, new
   MyRangeQueryNodeBuilder()), where MyRangeQueryNodeBuilder is my
   new class that handles the dates.

?

Ie, is that (swapping in your own QueryBuilder) the right way to tweak
how the DefaultQueryParser would create queries?

RE: basic questions on the new QueryParser

Reply via email to