Hello,
I know the emoji properties [1] are no formally part of the UCD (not sure
exactly why though), but are there any plans to integrate the data in the
ucdxml [2] (possibly as separate files) ?
Thanks,
Daniel
[1] http://www.unicode.org/reports/tr51/#Emoji_Properties_and_Data_Files
[2]
On 7/5/2017 10:01 AM, Daniel Bünzli via Unicode wrote:
I know the emoji properties [1] are no formally part of the UCD (not sure
exactly why though),
Because they are maintained as part of an independent standard now (UTS
#51), which is still on track to have a faster turnaround -- and
Manuel,
I suspect that such a link may already be in the works for the
/Public/emoji/ data directory. But if you want to make sure your
suggestion is reviewed by the UTC, you should submit it via the contact
form:
http://www.unicode.org/reporting.html
--Ken
On 7/5/2017 12:37 PM, Manuel
I want to segment a Unicode text into runs according to their script.
I've had a look through UAX#24 in the hope of finding a standard
algorithm for doing this, but there isn't one specified. The
implementation section gives some good pointers for what to be careful
with (paired punctuation, etc.)
On Thu, Jul 06, 2017 at 09:43:29AM +1000, Simon Cozens via Unicode wrote:
> I want to segment a Unicode text into runs according to their script.
> I've had a look through UAX#24 in the hope of finding a standard
> algorithm for doing this, but there isn't one specified. The
> implementation
>> but are there any plans to integrate the data in the ucdxml [2]
>> (possibly as separate files) ?
>
> No. Not unless and until they become formally part of the UCD.
In this context: Would it be possible for the maintainers of the TR #51
data files to add a symlink "latest" under
6 matches
Mail list logo