Re: ICUFoldingFilter

2018-06-05 Thread Michael Sokolov
uir wrote: > >> > This cannot be "tweaked" at runtime, it is implemented as custom > >> normalization. > >> > > >> > You can modify the sources / build your own ruleset or use a different > >> > tokenfilter to normalize characters.

Re: ICUFoldingFilter

2018-06-04 Thread Robert Muir
> This cannot be "tweaked" at runtime, it is implemented as custom >> normalization. >> > >> > You can modify the sources / build your own ruleset or use a different >> > tokenfilter to normalize characters. >> > >> > On Mon, Jun 4, 2

Re: ICUFoldingFilter

2018-06-04 Thread Michael Sokolov
wrote: > > This cannot be "tweaked" at runtime, it is implemented as custom > normalization. > > > > You can modify the sources / build your own ruleset or use a different > > tokenfilter to normalize characters. > > > > On Mon, Jun 4, 2018 at 9:07 AM, Michael

Re: ICUFoldingFilter

2018-06-04 Thread Robert Muir
not be "tweaked" at runtime, it is implemented as custom > normalization. > > You can modify the sources / build your own ruleset or use a different > tokenfilter to normalize characters. > > On Mon, Jun 4, 2018 at 9:07 AM, Michael Sokolov wrote: >> Hi, I'm using I

Re: ICUFoldingFilter

2018-06-04 Thread Robert Muir
This cannot be "tweaked" at runtime, it is implemented as custom normalization. You can modify the sources / build your own ruleset or use a different tokenfilter to normalize characters. On Mon, Jun 4, 2018 at 9:07 AM, Michael Sokolov wrote: > Hi, I'm using ICUFoldingFilter and

ICUFoldingFilter

2018-06-04 Thread Michael Sokolov
Hi, I'm using ICUFoldingFilter and for the most part it does exactly what I want. However there are some behaviors I'd like to tweak. For example it maps "aaa^bbb" to "aaabbb". I am trying to understand why it does that, and whether there is any way to prevent it. I spent a l

RE: ICUFoldingFilter loading in IDE, but not jar ?!

2017-08-16 Thread Uwe Schindler
ormer Uwe - Uwe Schindler Achterdiek 19, D-28357 Bremen http://www.thetaphi.de eMail: u...@thetaphi.de > -Original Message- > From: Allison, Timothy B. [mailto:talli...@mitre.org] > Sent: Wednesday, August 16, 2017 4:41 AM > To: java-user@lucene.apache.org > Subject: RE: ICUFol

RE: ICUFoldingFilter loading in IDE, but not jar ?!

2017-08-15 Thread Allison, Timothy B.
never mind...overwriting service file... -Original Message- From: Allison, Timothy B. [mailto:talli...@mitre.org] Sent: Tuesday, August 15, 2017 10:36 PM To: java-user@lucene.apache.org Subject: ICUFoldingFilter loading in IDE, but not jar ?! In Intellij, when I run unit tests in my

ICUFoldingFilter loading in IDE, but not jar ?!

2017-08-15 Thread Allison, Timothy B.
In Intellij, when I run unit tests in my app that uses Lucene (6.6.0) and the ICUFoldingFilterFactory, I see 96 filter factories available via TokenFilterFactory.availableTokenFilters(). When I run the same code from a jar built with the maven shade plugin, and I confirm that the jar actually

CJKWidthFilter vs ICUFoldingFilter

2012-11-14 Thread Scott Smith
Reading the documentation for these two filters seems to imply that CJKWidthFilter is a subset of ICUFoldingFilter. Is that true? I'm basically using the CjkAnalyzer (from Lucene 4.0) but adding ICUFoldingFilter because I need umlauts and accent characters removed from any German, French, etc

Re: CJKWidthFilter vs ICUFoldingFilter

2012-11-14 Thread Robert Muir
On Wed, Nov 14, 2012 at 9:47 AM, Scott Smith ssm...@mainstreamdata.com wrote: Reading the documentation for these two filters seems to imply that CJKWidthFilter is a subset of ICUFoldingFilter. Is that true? I'm basically using the CjkAnalyzer (from Lucene 4.0) but adding ICUFoldingFilter

RE: CJKWidthFilter vs ICUFoldingFilter

2012-11-14 Thread Scott Smith
Thanks -Original Message- From: Robert Muir [mailto:rcm...@gmail.com] Sent: Wednesday, November 14, 2012 12:17 PM To: java-user@lucene.apache.org Subject: Re: CJKWidthFilter vs ICUFoldingFilter On Wed, Nov 14, 2012 at 9:47 AM, Scott Smith ssm...@mainstreamdata.com wrote: Reading