Re: Untokenized lowercase string

2012-09-25 Thread am
Just wanted to confirm that this: ...works beautifully for untokenized lowercase values. Starting spaces and spaces in the middle work fine. -- View this message in context: http://lucene.472066.n3.nabble.com/Untokenized-lowercase-string-tp4010296p4010351.html Sent from the

Re: Untokenized lowercase string

2012-09-25 Thread am
That sounds right, thanks! I missed KeywordTokenizerFactory, with a name like that it did not sound like what I wanted. I expected NullTokenizerFactory or something standing out like that :) Jack Krupansky-2 wrote > Use the KeywordTokenizerFactory for your text field tokenizer to keep the > text

Re: Untokenized lowercase string

2012-09-25 Thread am
Alexandre Rafalovitch wrote > Each field has a type. Each type defines what happens with the text. > You can certainly select to do one thing but not another. Understood. But it seemed to me that only TextField allows adding filters to it and filters go in conjunction with tokenizers. I could not

Re: Untokenized lowercase string

2012-09-25 Thread Jack Krupansky
Use the KeywordTokenizerFactory for your text field tokenizer to keep the text from being tokenized, and then use the LowerCaseFilterFactory token filter to do the lowercasing. Unfortunately, "string" (StrField) does not support analysis. -- Jack Krupansky -Original Message- From: am

Re: Untokenized lowercase string

2012-09-25 Thread Alexandre Rafalovitch
Each field has a type. Each type defines what happens with the text. You can certainly select to do one thing but not another. Just look towards the bottom of the schema.xml and compare field types definition for string and text, it should be fairly obvious. You'll most probably make up a new type