[ https://issues.apache.org/jira/browse/LUCENE-6138?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Massimo Pasquini updated LUCENE-6138: ------------------------------------- Summary: ItalianLightStemmer doesn't apply on words shorter then 6 chars in length (was: ItalianLightStemmer) > ItalianLightStemmer doesn't apply on words shorter then 6 chars in length > ------------------------------------------------------------------------- > > Key: LUCENE-6138 > URL: https://issues.apache.org/jira/browse/LUCENE-6138 > Project: Lucene - Core > Issue Type: Bug > Components: modules/analysis > Affects Versions: 4.10.2 > Reporter: Massimo Pasquini > Priority: Minor > > I expect a stemmer to transform nouns in their singular and plural forms into > a shorter common form. The implementation of the ItalianLightStemmer doesn't > apply any stemming to words shorter then 6 characters in length. This leads > to some annoying results: > singular form | plural form > 4|5 chars in length (no stemming) > alga -> alga | alghe -> alghe > fuga -> fuga | fughe -> fughe > lega -> lega | leghe -> leghe > 5|6 chars in length (stemming only on plural form) > vanga -> vanga | vanghe -> vang > verga -> verga | verghe -> verg > I suppose that such limitation on words length is to avoid other side effects > on shorter words not in the set above, but I think something must be reviewed > in the code for better results. -- This message was sent by Atlassian JIRA (v6.3.4#6332) --------------------------------------------------------------------- To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org