Re: Lowercase all characters in String

Zheng Lin Edwin Yeo Tue, 11 Oct 2016 16:41:40 -0700

Thanks Ahmet and Walter.

It works.


Regards,
Edwin


On 11 October 2016 at 23:36, Walter Underwood <wun...@wunderwood.org> wrote:

> Like this:
>
>     <!-- Like a string class, but lower cased -->
>     <fieldType name="string_lower" class="solr.TextField">
>       <analyzer>
>         <tokenizer class="solr.KeywordTokenizerFactory"/>
>         <filter class="solr.LowerCaseFilterFactory"/>
>       </analyzer>
>     </fieldType>
>
> wunder
> Walter Underwood
> wun...@wunderwood.org
> http://observer.wunderwood.org/  (my blog)
>
>
> > On Oct 11, 2016, at 7:43 AM, Ahmet Arslan <iori...@yahoo.com.INVALID>
> wrote:
> >
> > Hi,
> >
> > KeywordTokenizer and LowerCaseFilter should suffice. Optionally you can
> add TrimFilter too.
> >
> > Ahmet
> >
> >
> > On Tuesday, October 11, 2016 5:24 PM, Zheng Lin Edwin Yeo <
> edwinye...@gmail.com> wrote:
> > Hi,
> >
> > Would like to find out, what is the best way to lowercase all the text,
> > while preserving all the tokens.
> >
> > As I need to preserve every character of the text (including symbols and
> > white space), I'm using String. However, I can't put the
> > LowerCaseFilterFactory in String.
> >
> > I found that we can use WhitespaceTokenizerFactory, followed by
> > LowerCaseFilterFactory. Although WhitespaceTokenizerFactory can preserve
> > the symbols, it will still split on Whitespace, which is what we do not
> > want. This is because we may have words like 'One' and 'One Way'. If we
> use
> > the WhitespaceTokenizerFactory and search for 'One', it will return
> records
> > with 'One Way' too, which is what we do not want.
> >
> > Is there other way which we can achieve this?
> >
> > I'm using Solr 6.2.1.
> >
> > Regards,
> > Edwin
>
>

Re: Lowercase all characters in String

Reply via email to