[
https://issues.apache.org/jira/browse/LUCENE-6137?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14258787#comment-14258787
]
Uwe Schindler edited comment on LUCENE-6137 at 12/25/14 5:43 PM:
-----------------------------------------------------------------
Hi,
I also agree you should raise this issue at the CLEF people who invented that
stemmer!
I talked with my wife (she has russian as mother language) and she can confirm
your problem with some *neutral* adjective forms, but - as expected - she can
confirm that removing "-ee" is too risky, because this would change also
comparative form (also using -ee), too, which is not intended to be done by a
"light stemmer". I think this might be the reason not to remove "-ee" by
default (this changes meaning).
was (Author: thetaphi):
Hi,
I also agree you should raise this issue at the CLEF people who invented that
stemmer!
I talked with my wife (she has russian as mother language) and she can confirm
your problem with some *neutral* adjective forms, but - as expected - she can
confirm that removing "-ee" is too risky, because this would change also
superlatives (also using -ee), too, which is not intended to be done by a
"light stemmer". I think this might be threason not to remove "-ee" by default
(this changes meaning).
> RussianLightStemmer incorrectly handles the words ending with 'ее'
> ------------------------------------------------------------------
>
> Key: LUCENE-6137
> URL: https://issues.apache.org/jira/browse/LUCENE-6137
> Project: Lucene - Core
> Issue Type: Bug
> Affects Versions: 4.10.2
> Reporter: Alexander Sofronov
> Attachments: LUCENE-6137.patch
>
>
> Consider the forms of Russian word "синий" and the result returned by
> RussianLightStemmer:
> синий -> син
> синяя -> син
> синее -> сине
> синие -> син
> I think the correct result should be:
> синий -> син
> синяя -> син
> синее -> син
> синие -> син
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]