[
https://issues.apache.org/jira/browse/LUCENE-5357?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13842265#comment-13842265
]
Robert Muir commented on LUCENE-5357:
-------------------------------------
{quote}
About back-compat: none of the JFlex-based tokenizers on trunk have
version-based behavior at this point, in contrast to branch_4x
{quote}
I would love if all these constants/parameters were completely removed in
trunk. if you look at the mailing lists, its obvious that users dont even
understand it at all. I dont know how index back compat got perverted into such
a thing that made all the analysis apis ugly and overcomplicated.
This stuff all hurts the project far more than any benefit it brings to the
rare few that understand it. I think it should be removed everywhere.
> Upgrade StandardTokenizer & co to latest unicode rules
> ------------------------------------------------------
>
> Key: LUCENE-5357
> URL: https://issues.apache.org/jira/browse/LUCENE-5357
> Project: Lucene - Core
> Issue Type: New Feature
> Components: modules/analysis
> Reporter: Robert Muir
> Assignee: Steve Rowe
> Fix For: 5.0, 4.7
>
> Attachments: LUCENE-5357.patch
>
>
> besides any change in data, the rules have also changed (regional indicators,
> better handling for hebrew, etc)
--
This message was sent by Atlassian JIRA
(v6.1#6144)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]