No, I do not mean parsing a query string.  There is a separate parser that 
parses the full XQuery language of which full-text searching is a small part.  
The parser builds an AST.  It will then be my job to walk the AST and look for 
full-text query nodes.  I then convert those into a Query object by using the 
various query classes, e.g., SpanQuery.  Hence, I would not be using the 
built-in CLucene QueryParser.

What I want to know is if the framework for CLucene is expressive enough to be 
able to handle the criteria I listed below.  To elaborate: will I be able to 
construct/perform queries that:

+ have a word or phrase that occurs at least/most N times
+ can use or not use stop-words on a query-by-query basis
+ can use wildcards
+ can be case-sensitive or insensitive
+ can be diacritical-mark-sensitive or insensitive
+ can keep track of things like sentences and paragraphs

Yes, I am willing to write my own derived classes.

Also, I thought of another question: can I implement my own index input/output 
classes so that CLucene stores/retrieves its index data using a mechanism of my 
choice?  I.e., if I didn't want to use the binary index files that CLucene 
normally creates but instead wanted to store the data inside blobs in a SQL 
database, could I by writing my own classes to do that?

- Paul


On Jan 5, 2010, at 11:43 AM, Itamar Syn-Hershko wrote:

> Hi Paul,
> 
> What do you mean by that?
> 
> CLucene should be handed with a Query object to perform a search, which
> could then be filtered using a Filter object (this to allow for more complex
> searches). Producing a Query object from a plain-text string is done by
> using a QueryParser; a default one comes with CLucene itself, but anyone can
> implement his own should the need arise. Same goes for Queries and Filters -
> you can create your own derived class to perform searches the way you want.
> 
> So if by "hand-coded" you mean creating your own classes, then yes, it can.
> 
> Itamar. 
> 
> -----Original Message-----
> From: Paul J. Lucas [mailto:p...@lucasmail.org] 
> Sent: Monday, January 04, 2010 10:20 PM
> To: clucene-developers@lists.sourceforge.net
> Subject: [CLucene-dev] Using CLucene to implement XQuery full-text search
> 
> Hi -
> 
> I'm looking at CLucene to implement the full-text search feature of XQuery:
> 
>       http://www.w3.org/TR/xpath-full-text-10/
> 
> Its query abilities are the most complicated I've seen.  Specifically, it
> allows one specify the following as part of a query:
> 
> + occurs at {least|most} {N} times
> + {with|without} stop words
> + {with|without} wildcards
> + case sensitive | lowercase | uppercase diacritics insensitive 
> + {same|different} {sentence|paragraph} at {start|end} | entire content
> 
> Can CLucene do all that if the queries are hand-coded?  Thanks.
> 
> - Paul

------------------------------------------------------------------------------
This SF.Net email is sponsored by the Verizon Developer Community
Take advantage of Verizon's best-in-class app development support
A streamlined, 14 day to market process makes app distribution fast and easy
Join now and get one step closer to millions of Verizon customers
http://p.sf.net/sfu/verizon-dev2dev 
_______________________________________________
CLucene-developers mailing list
CLucene-developers@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/clucene-developers

Reply via email to