fix reverseStringFilter for unicode 4.0
---------------------------------------
Key: LUCENE-2068
URL: https://issues.apache.org/jira/browse/LUCENE-2068
Project: Lucene - Java
Issue Type: Improvement
Components: contrib/analyzers
Reporter: Robert Muir
Priority: Minor
Fix For: 3.1
ReverseStringFilter is not aware of supplementary characters: when it reverses
it will create unpaired surrogates, which will be replaced by U+FFFD by the
indexer (but not at query time).
The wrong words will conflate to each other, and the right words won't match,
basically the whole thing falls apart.
This patch implements in-place reverse with the algorithm from apache harmony
AbstractStringBuilder.reverse0()
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]