[ https://issues.apache.org/jira/browse/LUCENE-841?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Paul Cowan updated LUCENE-841: ------------------------------ Attachment: lucene-841.patch Patch which replaces all non-ASCII characters in the 4 mentioned stemmer files with their \uxxxx equivalents. For anyone else who ever needs to do this, it's a 10-second job in the free Windows editor BabelPad (http://www.babelstone.co.uk/Software/BabelPad.html) > Replace UTF8 characters in stemmer code with integer values. > ------------------------------------------------------------ > > Key: LUCENE-841 > URL: https://issues.apache.org/jira/browse/LUCENE-841 > Project: Lucene - Java > Issue Type: Improvement > Components: Analysis > Reporter: Karl Wettin > Priority: Critical > Attachments: lucene-841.patch > > > BrazillianStemmer, GermanStemmer, FrenchStemmer and DutchStemmer all contains > UTF characters in the java code. All environments does not handle that. It > really ought to be integer values instead. > I'll come up with a patch sooner or later. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online. --------------------------------------------------------------------- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]