[
https://issues.apache.org/jira/browse/LUCENE-6016?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14178441#comment-14178441
]
Jeff Stein commented on LUCENE-6016:
------------------------------------
Thanks for your response Robert. I'm a new user running into the problem, not
the person who opened the Elasticsearch plugin problem.
I disagree with your assessment. I'm working on an application with support for
searching multiple languages and Polish is the only algarithimic stemmer that
has this problem.
> Stempel converts trailing 1 (and prior character) to ć
> ------------------------------------------------------
>
> Key: LUCENE-6016
> URL: https://issues.apache.org/jira/browse/LUCENE-6016
> Project: Lucene - Core
> Issue Type: Bug
> Components: modules/analysis
> Affects Versions: 4.8.1
> Reporter: Jeff Stein
>
> In the stempel analysis module, the StempelFilter TokenFilter converts a
> trailing numeric one into a ć character, while also consuming the prior
> character. This was also filed against the downstream
> [elasticsearch-analysis-stempel
> project|https://github.com/elasticsearch/elasticsearch-analysis-stempel/issues/31].
> I did not find any errors with other numbers in the trailing position.
> Example:
> || input || output ||
> | foo1 | foć |
> | foo11 | fooć |
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]