[ https://issues.apache.org/jira/browse/LUCENE-4063?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13277635#comment-13277635 ]
Tanguy Moal commented on LUCENE-4063: ------------------------------------- I'd be glad to see this on 3.x x >=4 since that's the version I used to spot the issue, may be should I have marked this issue as a bug rather than improvement ? :-) I have a custom filterfactory marking numbers as keywords anyway as I needed a quick fix. So from my point of view it doesn't really matter... I could just drop that filter from my analysis if the patch finds its way to 3x. Thank you very much for your quick responses about this issue. > FrenchLightStemmer performs abusive compression of (arbitrary) repeated > characters in long tokens > ------------------------------------------------------------------------------------------------- > > Key: LUCENE-4063 > URL: https://issues.apache.org/jira/browse/LUCENE-4063 > Project: Lucene - Java > Issue Type: Improvement > Components: modules/analysis > Affects Versions: 3.4, 4.0 > Reporter: Tanguy Moal > Assignee: Steven Rowe > Priority: Minor > Attachments: LUCENE-4063.patch, SOLR-3463.patch, SOLR-3463.patch, > SOLR-3463.patch > > > FrenchLightStemmer performs aggressive deletions on repeated character > sequences, even on numbers. > This might be unexpected during full text search. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira --------------------------------------------------------------------- To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org