[
https://issues.apache.org/jira/browse/LUCENE-9914?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17317121#comment-17317121
]
Robert Muir commented on LUCENE-9914:
-------------------------------------
FYI: For the jflex we want the unicode version to match what the rest of the
jflex grammar is using. Sometimes new unicode versions have features that
require new jflex versions.
So we may want to add something like the following to the script to make it
clear what version it was generated with:
{code}
import com.ibm.icu.lang.UCharacter;
import com.ibm.icu.util.VersionInfo;
System.out.println("// Unicode Version: " + UCharacter.getUnicodeVersion());
System.out.println("// ICU Version: " + VersionInfo.ICU_VERSION);
{code}
> Modernize Emoji regeneration scripts
> ------------------------------------
>
> Key: LUCENE-9914
> URL: https://issues.apache.org/jira/browse/LUCENE-9914
> Project: Lucene - Core
> Issue Type: Improvement
> Reporter: Dawid Weiss
> Assignee: Dawid Weiss
> Priority: Minor
>
> These are perl scripts... I don't think they had ant tasks in 8x and they
> haven't been used in a while. They don't seem too scary (for perl) - just
> fetch emoji unicode descriptions and parse them into a jflex macro and a test
> case.
> It'd be good to convert them to use python, groovy or even java so that they
> fit better in the build system. Alternatively - perhaps there is a way to get
> these codepoint properties from Java directly?
--
This message was sent by Atlassian Jira
(v8.3.4#803005)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]