Advice on how to approach character translation

R (Chandra) Chandrasekhar Wed, 23 Apr 2008 02:34:25 -0700

Dear Folks,

A scheme called ITRANS uses the ASCII printing character set and between one andthree printing characters to unambiguously represent characters in Indicscripts or a Romanized script called IAST. Since characters in these scriptshave Unicode code points, it should be possible to automate the translationbetween words in the ASCII source text and the desired Unicoded output text.

I am trying to write a Perl script to do this and would appreciate advice on howbest to proceed before I start.

To give a better picture of what I am trying to do, I have given some examplesbelow for ASCII to IAST characters:


--------

1. Transliteration of between one and three ASCII printing characters to oneUnicode character.


2. Many characters are unchanged by the transliteration.

3. Some transliteration examples are shown below:

a       a   U+0061   LATIN SMALL LETTER A
aa      ā   U+0101   LATIN SMALL LETTER A WITH MACRON
A       ā   U+0101   LATIN SMALL LETTER A WITH MACRON
.a      '   U+0027   APOSTROPHE
~N      ṅ   U+1E45   LATIN SMALL LETTER N WITH DOT ABOVE
RRI     ṝ   U+1E5D   LATIN SMALL LETTER R WITH DOT BELOW AND MACRON
R^I     ṝ   U+1E5D   LATIN SMALL LETTER R WITH DOT BELOW AND MACRON
--------

Many thanks.

Chandra

--
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]
http://learn.perl.org/

Advice on how to approach character translation

Reply via email to