abstract characters, semantics, meaningful transformations ... Was: Tibetan Paluta

Naena Guru via Unicode Mon, 01 May 2017 07:26:13 -0700

This whole attempt to make digitizing Indic script some esoteric,'abstract', 'semantic representation' and so on seems to me is anattempt to make Unicode the realm of the some super humans.

The purpose of writing is to represent speech. It is not some secretthat demi-gods created that we are trying to explain with 'modern'linguistic gymnastics. sound => letter that is the basis for writing.English writing was massacred when printing was brought in from Europe.A similar thing is happening to Indic by all this mumbo-jumbo.

I call out to NATIVE users of Indic to explain what apparently Europeansor Americans are discussing here.



On 5/1/2017 10:47 AM, Philippe Verdy wrote:

2017-04-29 21:21 GMT+02:00 Naena Guru via Unicode <unicode@unicode.org<mailto:unicode@unicode.org>>:
    Just about the name paluta:
    In Sanskrit, the length of vowels are measured in maaþra (a
    cognate of the word 'meter'). It is the spoken length of a short
    vowel. In Latin it is termed mora. Usually, you have only single
    and double length vowels. A paluþa length is like when you call
    out somebody from a distance. Pluta is a careless use of spelling.
    Virama and Halanta are two other terms loosely used.

    Anyway, Unicode is only about DISPLAYING a script: There's a shape
    here; Let's find how to get it by assembling other shapes or by
    creating a code point for it. What is short, long or longer in
    speech is no concern for Unicode.
Wrong. Unicode is absolutely not about how to "display" any script(except symbols and notational symbols). Unicode does not encodeglyphs. Unicode encodes "abstract characters" according to theirsemantics, in order to assign them properties allowing meaningfultransformations of text and in order to allow perfoirming searches(with collation algorithms). What is important is their properties(something that ISO 10646 does not care when it started the UCS in aseparate project, ignoring how it would be used, focusing too much onapparent glytphs (and introducing lot of "compatiblity characters"that would not have been encoded otherwise, and creating some havoc inlogical processing.
Anyway Unciode makes some exceptions to the logical model only forroundtrip comptaibility with other standards that used anotherencoding model widely used, notably in Thai: these are the exceptionwhere there are "prepended" letters. There was some havoc also forsome scripts in India because of roundtrip compatiblity with an Indianstandard (criticized by many users of Tamil and some other SouthernIndic scripts that don't follow directly the paradigm created forgetting some limited transliteration with Devanagari: that initialdesire was abandoned but the legacy Indic scripts in India wereimported as is to Unicode)

abstract characters, semantics, meaningful transformations ... Was: Tibetan Paluta

Reply via email to