On Thu, Mar 10 2016 at 22:40 CET, [email protected] writes: [...]
> The *reason* that NamesList.txt exists at all is to drive the tool, unibook, > that formats the full Unicode code charts for posting. It is only > posted in the Unicode Character Database at all as a matter of > convenience, to give people access to a text only version of the > names list that appears in the fully formatted pdf versions of the > code charts > that contain all the representative glyphs. > > NamesList.txt should *not* be data mined. I've just noticed that NamesList.txt is in a sense data mined by the Unicode consortium itself. I mean the "Unicode Utilities: Character Properties", which e.g. for LATIN SMALL LETTER P WITH FLOURISH (http://unicode.org/cldr/utility/character.jsp?a=A753) display in particular subhead: Medievalist addition Am I right that this information is available only in NamesList.txt? In my opinion this is important information and should be officially available for character data mining engines. Best regards Janusz -- , Prof. dr hab. Janusz S. Bien - Uniwersytet Warszawski (Katedra Lingwistyki Formalnej) Prof. Janusz S. Bien - University of Warsaw (Formal Linguistics Department) [email protected], [email protected], http://fleksem.klf.uw.edu.pl/~jsbien/

