[jira] [Commented] (LUCENE-6016) Stempel converts trailing 1 (and prior character) to ć

Jeff Stein (JIRA) Tue, 21 Oct 2014 07:07:29 -0700

    [ 
https://issues.apache.org/jira/browse/LUCENE-6016?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14178441#comment-14178441
 ]


Jeff Stein commented on LUCENE-6016:
------------------------------------

Thanks for your response Robert. I'm a new user running into the problem, not 
the person who opened the Elasticsearch plugin problem.

I disagree with your assessment. I'm working on an application with support for 
searching multiple languages and Polish is the only algarithimic stemmer that 
has this problem.

> Stempel converts trailing 1 (and prior character) to ć
> ------------------------------------------------------
>
>                 Key: LUCENE-6016
>                 URL: https://issues.apache.org/jira/browse/LUCENE-6016
>             Project: Lucene - Core
>          Issue Type: Bug
>          Components: modules/analysis
>    Affects Versions: 4.8.1
>            Reporter: Jeff Stein
>
> In the stempel analysis module, the StempelFilter TokenFilter converts a 
> trailing numeric one into a ć character, while also consuming the prior 
> character. This was also filed against the downstream 
> [elasticsearch-analysis-stempel 
> project|https://github.com/elasticsearch/elasticsearch-analysis-stempel/issues/31].
> I did not find any errors with other numbers in the trailing position.
> Example:
> || input || output ||
> | foo1 | foć |
> | foo11 | fooć |



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

[jira] [Commented] (LUCENE-6016) Stempel converts trailing 1 (and prior character) to ć

Reply via email to