Re: Using a QueryParser with an untokenized field?

Eleanor Joslin Fri, 01 Feb 2008 04:47:13 -0800

Thank you, this was exactly what I needed. So "tokenizing" reallydenotes a more general process that can involve normalizing the case orwhatever else can be done with a filter. This is where I was confused.


Eleanor


Jan Peter Stotz wrote:

Hi Eleanor.
In my Lucene index there's a field that contains the local names ofXML elements, one name per document. Users can enter arbitraryqueries for this field, so I'm using a QueryParser.
From reading around it looks as if the field needs to be tokenized,but since the field's content is always a single term, is this reallynecessary?
You are right, your field is already tokenized, but from what I know themain difference is that untokenized fields do not pass your selectedanalyzer when being added to the index. If your analyzer for exampleincorporates the LowerCaseFilter, the field will be converted intolower case before it is indexed. When using the same analyzer for yourQueryParser this will allow you to perform case insensitive query.
If you add the field untokenized and your Analyzer (at query time)incorporates the LowerCaseFilter, you will be unable find elements thatcontain upper characters.
Jan

---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]



--
Eleanor Joslin, Software Development   DecisionSoft Ltd.
Telephone: +44-1865-203192             http://www.decisionsoft.com

---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Re: Using a QueryParser with an untokenized field?

Reply via email to