[ 
https://issues.apache.org/jira/browse/LUCENE-5357?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13842265#comment-13842265
 ] 

Robert Muir commented on LUCENE-5357:
-------------------------------------

{quote}
About back-compat: none of the JFlex-based tokenizers on trunk have 
version-based behavior at this point, in contrast to branch_4x
{quote}

I would love if all these constants/parameters were completely removed in 
trunk. if you look at the mailing lists, its obvious that users dont even 
understand it at all. I dont know how index back compat got perverted into such 
a thing that made all the analysis apis ugly and overcomplicated.

This stuff all hurts the project far more than any benefit it brings to the 
rare few that understand it. I think it should be removed everywhere.

> Upgrade StandardTokenizer & co to latest unicode rules
> ------------------------------------------------------
>
>                 Key: LUCENE-5357
>                 URL: https://issues.apache.org/jira/browse/LUCENE-5357
>             Project: Lucene - Core
>          Issue Type: New Feature
>          Components: modules/analysis
>            Reporter: Robert Muir
>            Assignee: Steve Rowe
>             Fix For: 5.0, 4.7
>
>         Attachments: LUCENE-5357.patch
>
>
> besides any change in data, the rules have also changed (regional indicators, 
> better handling for hebrew, etc)



--
This message was sent by Atlassian JIRA
(v6.1#6144)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to