[jira] [Commented] (LUCENE-8366) upgrade to icu 62.1
[ https://issues.apache.org/jira/browse/LUCENE-8366?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16882943#comment-16882943 ] Mathieu Marie commented on LUCENE-8366: --- I created LUCENE-8910 to handle that issue. I'll propose a patch. > upgrade to icu 62.1 > --- > > Key: LUCENE-8366 > URL: https://issues.apache.org/jira/browse/LUCENE-8366 > Project: Lucene - Core > Issue Type: Improvement > Components: modules/analysis >Reporter: Robert Muir >Priority: Major > Fix For: trunk, 7.5 > > Attachments: LUCENE-8366.patch > > > This gives unicode 11 support. > Also emoji tokenization is simpler and it gives a way to have better > tokenization for emoji from the future. -- This message was sent by Atlassian JIRA (v7.6.14#76016) - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org
[jira] [Commented] (LUCENE-8366) upgrade to icu 62.1
[ https://issues.apache.org/jira/browse/LUCENE-8366?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16882575#comment-16882575 ] Robert Muir commented on LUCENE-8366: - good catch! do you want to submit a fix? > upgrade to icu 62.1 > --- > > Key: LUCENE-8366 > URL: https://issues.apache.org/jira/browse/LUCENE-8366 > Project: Lucene - Core > Issue Type: Improvement > Components: modules/analysis >Reporter: Robert Muir >Priority: Major > Fix For: trunk, 7.5 > > Attachments: LUCENE-8366.patch > > > This gives unicode 11 support. > Also emoji tokenization is simpler and it gives a way to have better > tokenization for emoji from the future. -- This message was sent by Atlassian JIRA (v7.6.3#76005) - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org
[jira] [Commented] (LUCENE-8366) upgrade to icu 62.1
[ https://issues.apache.org/jira/browse/LUCENE-8366?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16881343#comment-16881343 ] Mathieu Marie commented on LUCENE-8366: --- sorry to comment on a closed issue. It seems to me that one file was not updated during the upgrade to 62-1. [https://github.com/apache/lucene-solr/blob/branch_7_5/lucene/analysis/icu/src/tools/java/org/apache/lucene/analysis/icu/GenerateUTR30DataFiles.java#L66 ] With that update, running again the ant target `gennorm2`should also bring 3 new files : * nfc.txt * nfkc.txt * nfkc_cf.txt > upgrade to icu 62.1 > --- > > Key: LUCENE-8366 > URL: https://issues.apache.org/jira/browse/LUCENE-8366 > Project: Lucene - Core > Issue Type: Improvement > Components: modules/analysis >Reporter: Robert Muir >Priority: Major > Fix For: trunk, 7.5 > > Attachments: LUCENE-8366.patch > > > This gives unicode 11 support. > Also emoji tokenization is simpler and it gives a way to have better > tokenization for emoji from the future. -- This message was sent by Atlassian JIRA (v7.6.3#76005) - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org
[jira] [Commented] (LUCENE-8366) upgrade to icu 62.1
[ https://issues.apache.org/jira/browse/LUCENE-8366?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16519031#comment-16519031 ] Adrien Grand commented on LUCENE-8366: -- +1 > upgrade to icu 62.1 > --- > > Key: LUCENE-8366 > URL: https://issues.apache.org/jira/browse/LUCENE-8366 > Project: Lucene - Core > Issue Type: Improvement > Components: modules/analysis >Reporter: Robert Muir >Priority: Major > Attachments: LUCENE-8366.patch > > > This gives unicode 11 support. > Also emoji tokenization is simpler and it gives a way to have better > tokenization for emoji from the future. -- This message was sent by Atlassian JIRA (v7.6.3#76005) - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org