emoji props in the ucdxml ?

2017-07-05 Thread Daniel Bünzli via Unicode
Hello,  I know the emoji properties [1] are no formally part of the UCD (not sure exactly why though), but are there any plans to integrate the data in the ucdxml [2] (possibly as separate files) ?  Thanks,  Daniel [1] http://www.unicode.org/reports/tr51/#Emoji_Properties_and_Data_Files [2]

Re: emoji props in the ucdxml ?

2017-07-05 Thread Ken Whistler via Unicode
On 7/5/2017 10:01 AM, Daniel Bünzli via Unicode wrote: I know the emoji properties [1] are no formally part of the UCD (not sure exactly why though), Because they are maintained as part of an independent standard now (UTS #51), which is still on track to have a faster turnaround -- and

Re: emoji props in the ucdxml ?

2017-07-05 Thread Ken Whistler via Unicode
Manuel, I suspect that such a link may already be in the works for the /Public/emoji/ data directory. But if you want to make sure your suggestion is reviewed by the UTC, you should submit it via the contact form: http://www.unicode.org/reporting.html --Ken On 7/5/2017 12:37 PM, Manuel

Algorithms for Unicode script detection

2017-07-05 Thread Simon Cozens via Unicode
I want to segment a Unicode text into runs according to their script. I've had a look through UAX#24 in the hope of finding a standard algorithm for doing this, but there isn't one specified. The implementation section gives some good pointers for what to be careful with (paired punctuation, etc.)

Re: Algorithms for Unicode script detection

2017-07-05 Thread Khaled Hosny via Unicode
On Thu, Jul 06, 2017 at 09:43:29AM +1000, Simon Cozens via Unicode wrote: > I want to segment a Unicode text into runs according to their script. > I've had a look through UAX#24 in the hope of finding a standard > algorithm for doing this, but there isn't one specified. The > implementation

Re: emoji props in the ucdxml ?

2017-07-05 Thread Manuel Strehl via Unicode
>> but are there any plans to integrate the data in the ucdxml [2] >> (possibly as separate files) ? > > No. Not unless and until they become formally part of the UCD. In this context: Would it be possible for the maintainers of the TR #51 data files to add a symlink "latest" under