Lucene 2.9 RC4 may need some changes in Solr Analyzers using CharStream & others
--------------------------------------------------------------------------------
Key: SOLR-1423
URL: https://issues.apache.org/jira/browse/SOLR-1423
Project: Solr
Issue Type: Task
Components: Analysis
Reporter: Uwe Schindler
Because of some backwards compatibility problems (LUCENE-1906) we changed the
CharStream/CharFilter API a little bit. Tokenizer now only has a input field of
type java.io.Reader (as before the CharStream code). To correct offsets, it is
now needed to call the Tokenizer.correctOffset(int) method, which delegates to
the CharStream (if input is subclass of CharStream), else returns an
uncorrected offset. Normally it is enough to change all occurences of
input.correctOffset() to this.correctOffset() in Tokenizers. It should also be
checked, if custom Tokenizers in Solr do correct their offsets.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.