Re: Emoji and Annotation data

Takao Fujiwara Sun, 26 Jun 2016 21:14:44 -0700

On 06/25/16 01:04, Mark Davis ☕️-san wrote:

You should never be scraping /any/ Unicode HTML files. They are not made for 
that, and there is no guarantee of stability.


I cannot find the license or descriptions about the HTML files.


The emoji files are built from data which is described in 
http://www.unicode.org/reports/tr51/
(plus CLDR annotations and collation)


OK, I need the data which packages Emoji unicode and the annotation.
It would be great if the data could be provided besides the html files.

Thanks,
Fujiwara


Mark
//////

On Fri, Jun 24, 2016 at 7:21 AM, Takao Fujiwara <[email protected] 
<mailto:[email protected]>> wrote:

    Hi,

    I'm working on IBus - the input method framework for Linux.
    I parse http://unicode.org/emoji/charts/emoji-list.html and create a 
dictionary between the annotations and the Emoji characters.
    Since the file size is large and it's often updated, I'm thinking how to 
maintain the file.

    I copied the file as http://ibus.github.io/files/ibus/emoji-list.html for 
the build at the moment.

    I have questions:
     - if unicode.org <http://unicode.org> provides the tarball of the stable 
html files or other data.
     - what is the license of the html files.

    Do you have any ideas?

    Thanks,
    Fujiwara

Re: Emoji and Annotation data

Reply via email to