Carrie Coy created SOLR-4122:
--------------------------------
Summary: EnglishMinimalStemmer incorrectly tokenizes words ending
in "hes" and "xes"
Key: SOLR-4122
URL: https://issues.apache.org/jira/browse/SOLR-4122
Project: Solr
Issue Type: Bug
Components: Schema and Analysis
Affects Versions: 4.0
Reporter: Carrie Coy
Stemmer tokenizes "dishes" to "dishe" and boxes to "boxe". Seems like this
addition would fix it.
case 'e':
if (len > 3 && s[len-3] == 'i' && s[len-4] != 'a' && s[len-4] != 'e') {
s[len - 3] = 'y';
return len - 2;
}
*if (s[len-3] == 'x' || s[len-3] == 'h' )
return len-2;*
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]