[
https://issues.apache.org/jira/browse/LUCENE-5357?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13841411#comment-13841411
]
ASF subversion and git services commented on LUCENE-5357:
---------------------------------------------------------
Commit 1548595 from [~steve_rowe] in branch 'dev/trunk'
[ https://svn.apache.org/r1548595 ]
LUCENE-5357: Upgrade StandardTokenizer and UAX29URLEmailTokenizer to Unicode
6.3; update UAX29URLEmailTokenizer's recognized top level domains in URLs and
Emails from the IANA Root Zone Database.
> Upgrade StandardTokenizer & co to latest unicode rules
> ------------------------------------------------------
>
> Key: LUCENE-5357
> URL: https://issues.apache.org/jira/browse/LUCENE-5357
> Project: Lucene - Core
> Issue Type: New Feature
> Components: modules/analysis
> Reporter: Robert Muir
> Attachments: LUCENE-5357.patch
>
>
> besides any change in data, the rules have also changed (regional indicators,
> better handling for hebrew, etc)
--
This message was sent by Atlassian JIRA
(v6.1#6144)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]