Re: NamesList.txt as data source

Janusz S. Bień Sat, 26 Mar 2016 02:15:08 -0700

On Thu, Mar 10 2016 at 22:40 CET, [email protected] writes:

[...]


> The *reason* that NamesList.txt exists at all is to drive the tool, unibook,
> that formats the full Unicode code charts for posting. It is only
> posted in the Unicode Character Database at all as a matter of
> convenience, to give people access to a text only version of the
> names list that appears in the fully formatted pdf versions of the
> code charts
> that contain all the representative glyphs.
>
> NamesList.txt should *not* be data mined.

I've just noticed that NamesList.txt is in a sense data mined by the
Unicode consortium itself. I mean the "Unicode Utilities: Character
Properties", which e.g. for LATIN SMALL LETTER P WITH FLOURISH
(http://unicode.org/cldr/utility/character.jsp?a=A753) display in
particular

subhead: Medievalist addition

Am I right that this information is available only in NamesList.txt?

In my opinion this is important information and should be officially
available for character data mining engines.

Best regards

Janusz

-- 
                           ,   
Prof. dr hab. Janusz S. Bien -  Uniwersytet Warszawski (Katedra Lingwistyki 
Formalnej)
Prof. Janusz S. Bien - University of Warsaw (Formal Linguistics Department)
[email protected], [email protected], http://fleksem.klf.uw.edu.pl/~jsbien/

Re: NamesList.txt as data source

Reply via email to