pupil's comment: Are Latin and Cyrillic essentially the same script?

JP Blankert (thuis & PC based) Thu, 18 Nov 2010 17:22:36 -0800

Dear all,

Still see myself as pupil reading introduction chart of unicode, but Iam happy to join the discussion on the Russian: it is quite differentfrom Latin. Apart from 33 characters in Russian alphabet = morecharacters and apart from quite a few characters that as English speakeryou clearly do not know, Latin and Russian indeed contain some similarcharacters. But watch out. There are if I am correct 3 a's in the world,in this email a (Latin) looks like a (Russian) but they are different.So the Russian a is quite suited for a hierogplyph attack (I will tryontslag.com, which is Dutch for dismissal.com, to see how search enginesreact. With Russian a. Punycode is different of the word as total).

Similar example: Ukraine i - looks like ours, but you can't register iton .rf (Russian Federation).

Experiment 1 year ago with *Reïntegratie.com*<http://www.google.nl/aclk?sa=l&ai=Cq32OAcrlTIelNsGTOoCQ8Z4GwoKpugHavNrYFpf09AgIABADKANQppe9lfj_____AWCRvJqFhBigAaryw_4DyAEBqQJLcsn7dNi2PqoEHE_QPDrLX54nLEfeere4hVxwC4D9yTrI81AEiP26BRMI9ayF7dSrpQIVyo0OCh1WKGKjygUA&ei=AcrlTLWoLsqbOtbQiJsK&sig=AGiWqtxaX45Uf8wTKRjRJAdJsIX8fkSunA&adurl=http://www.arboned.nl/diensten/arbeidsdeskundig-advies/dienst/arbeidsdeskundig-reintegratieonderzoek/>being correct Dutch for reintegration, but being impossible asdomainname because SIDN.nl (supposed to be nic.nl) is very conservativeand does not even allow signs gave as result: in the beginning Googleappreciated and appreciated it....after a few months the hosted andfilled site 'sank'.(I borrowed the **ï*<http://www.google.nl/aclk?sa=l&ai=Cq32OAcrlTIelNsGTOoCQ8Z4GwoKpugHavNrYFpf09AgIABADKANQppe9lfj_____AWCRvJqFhBigAaryw_4DyAEBqQJLcsn7dNi2PqoEHE_QPDrLX54nLEfeere4hVxwC4D9yTrI81AEiP26BRMI9ayF7dSrpQIVyo0OCh1WKGKjygUA&ei=AcrlTLWoLsqbOtbQiJsK&sig=AGiWqtxaX45Uf8wTKRjRJAdJsIX8fkSunA&adurl=http://www.arboned.nl/diensten/arbeidsdeskundig-advies/dienst/arbeidsdeskundig-reintegratieonderzoek/>*from Catalan, amidst Latin characters).

News about ss / sz to whom is interested: most Germans were alert(ss-holders had priority to /ß)//, /so no/Fußbal/l for me, but onlyexperimental domain names IDNexpress.de and IDNexpre/ß.de. /It was amini-landrush on Nov. 16 2010, 10:00 German time onwards (Denic.de)

/Very busy with .rf auction now, in December I will put 2 differentsites on these ss and sz names so people can wonder at their screens tosee what is happening.

Above reaction was more out of domain names and practical experiencethan chartUTFxyz - but definitely: different script.


Br,

Philippe


On 18-11-2010 20:04, Asmus Freytag wrote:

On 11/18/2010 8:04 AM, Peter Constable wrote:
From: [email protected] [mailto:[email protected]]On Behalf Of André Szabolcs Szelp
AFAIR the reservations of WG2 concerning the encoding of Jangalif
Latin Ь/ь as a new character were not in view of Cyrillic Ь/ь, but
rather in view of its potential identity with the tone sign mentioned
by you as well. It is a Latin letter adapted from the Cyrillic softsign,
There's another possible point of view: that it's a Cyrilliccharacter that, for a short period, people tried using as a Latincharacter but that never stuck, and that it's completely adequate torepresent Janalif text in that orthography using the Cyrillic soft sign.
When one language borrows a word from another, there are severalstages of "foreignness", ranging from treating the foreign word as ashort quotation in the original language to treating it as essentiallyfully native.
Now words are very complex in behavior and usage compared tocharacters. You can check for pronunciation, spelling and adaptationto the host grammar to check which stage of adaptation a word hasreached.
When a script borrows a letter from another, you are essentiallylimited in what evidence you can use to document objectively whetherthe borrowing has crossed over the script boundary and the characterhas become "native".
With typographically closely related scripts, getting tell-taletypographical evidence is very difficult. After all, these scriptsstarted out from the same root.
So, you need some other criteria.
You could individually compare orthographies and decide which ones are"important" enough (or "established" enough) to warrant support. Oryou could try to distinguish between orthographies for general usewithing the given language, vs. other systems of writing(transcriptions, say).
But whatever you do, you should be consistent and take account ofexisting precedent.
There are a number of characters encoded as nominally "Latin" inUnicode that are borrowings from other scripts, usually Greek.
A discussion of the current issue should include explicit explanationof why these precedents apply or do not apply, and, in the lattercase, why some precedents may be regarded as examples of past mistakes.
By explicitly analyzing existing precedents, it should be possible toavoid the impression that the current discussion is focused on therelative merits of a particular orthography based on personal andpossibly arbitrary opinions by the work group experts.
If it can be shown that all other cases where such borrowings wereaccepted into Unicode are based on orthographies that are morepermanent, more widespread or both, or where other technical ortypographical reasons prevailed that are absent here, then it wouldmake any decision on the current request seem a lot less arbitrary.
I don't know where the right answer lies in the case of Janalif, orwhich point of view, in Peter's phrasing, would make the most sense,but having this discussion without clear understanding of theprecedents will lead to inconsistent encoding.
A./



Geen virus gevonden in het binnenkomende-bericht.
Gecontroleerd door AVG - www.avg.com
Versie: 9.0.869 / Virusdatabase: 271.1.1/3264 - datum van uitgifte: 11/18/10 
08:37:00

pupil's comment: Are Latin and Cyrillic essentially the same script?

Reply via email to