[
https://issues.apache.org/jira/browse/SOLR-1410?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12751374#action_12751374
]
Shalin Shekhar Mangar commented on SOLR-1410:
---------------------------------------------
bq. I don't think we've ever really had a situation like this ...logging a
warning seems like the right course of action for now ...
We actually have done this in DataImportHandler in relation to the syntax for
evaluators. Logging a warning is the right way to go.
> remove deprecated custom encoding support in russian/greek analysis
> -------------------------------------------------------------------
>
> Key: SOLR-1410
> URL: https://issues.apache.org/jira/browse/SOLR-1410
> Project: Solr
> Issue Type: Task
> Components: Analysis
> Reporter: Robert Muir
> Priority: Minor
> Attachments: SOLR-1410.patch
>
>
> In this case, analyzers have strange encoding support and it has been
> deprecated in lucene.
> For example someone using CP1251 in the russian analyzer is simply storing Ж
> as 0xC6, its being represented as Æ
> LUCENE-1793: Deprecate the custom encoding support in the Greek and Russian
> Analyzers. If you need to index text in these encodings, please use Java's
> character set conversion facilities (InputStreamReader, etc) during I/O,
> so that Lucene can analyze this text as Unicode instead.
> I noticed in solr, the factories for these tokenstreams allow these
> configuration options, which are deprecated in 2.9 to be removed in 3.0
> Let me know the policy (how do you deprecate a config option in solr exactly,
> log a warning, etc?) and I'd be happy to create a patch.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.