Hi Aditya,
--- Aditya Gokhale [EMAIL PROTECTED] wrote:
I had few query regarding representation of Devanagari script in
Unicode
(Code page - 0x0900 - 0x097F). Devanagari is a writing script, is used in
Hindi, Marathi and Sanskrit languages. I have following questions -
In the same
Hi,
Forgot to reply implementation query. The reply is inline.
--- Aditya Gokhale [EMAIL PROTECTED] wrote:
2. Implementation Query -
In an implementation where I need to send / process Hindi, Marathi
and Sanskrit data, how do I differentiate between languages (Hindi,
Marathi and
Hello,
Thanks for the reply. I will check the points as you said, as far as the
font issues are considered. We all know how jna,shra and ksh are formed in
UNICODE and ISCII, but the point I wanted to make was, if we have to sort /
search / process the data in Devanagari script, then we have
Aditya Gokhale wrote:
Hello Everybody,
I had few query regarding representation of Devanagari
script in Unicode
All your questions are FAQ's, so I'll just reference the entries which
answers them.
(Code page - 0x0900 - 0x097F). Devanagari is a writing
script, is used in Hindi, Marathi
--- Asmus Freytag [EMAIL PROTECTED] wrote:
All of the above can be composed through following consonant clusters:
jna - ja halant nya
shra - sha halant ra
ksh - ka halant ssha
The point that the above sequences are considered as characters in some
of
the Indian languages has
Hello,
There are few discrepancies in Indic FAQ. Though it was reported earlier by
Andy White, I see they still have place there in the FAQ. I also clarified
it but by mistake I sent the mail to Yahoo groups where this mailing list
is archived and hence my mail never reached to this mailing list.
Keyur Shroff scripsit:
Sentiments are attached with cultures which may vary from one geographical
area to another. So when one of the many languages falling under the same
script dominate the entire encoding for the script, then other group of
people may feel that their language has not been
At 02:13 -0800 2003-01-29, Keyur Shroff wrote:
I beg to differ with you on this point. Merely having some provision for
composing a character doesn't mean that the character is not a candidate
for inclusion as separate code point.
Yes, it does.
India is a big country with millions of people
Keyur Shroff wrote:
In the FAQ
http://www.unicode.org/faq/indic.html#16
It is mentioned that following are equivalent
ISCII Unicode
KA halant INV KA virama ZWJ
RA halant INV RAsup (i.e., repha)
The last line is really bizarre! I would agree that it is
The [new] INV character in Unicode can also be used for displaying dependent
vowel matras without dotted circle.
A space followed by a dependent vowel sign should display just the
dependent vowel sign, no dotted circle. Indeed, (except for a show
invisibles mode, or a character chart display
--- Marco Cimarosti [EMAIL PROTECTED] wrote:
Why not representing INV with a double ZWJ? E.g.:
ISCII Unicode
KA halant INV KA virama ZWJ ZWJ
RA halant INV RA virama ZWJ ZWJ (i.e., repha)
INV halant RA ZWJ ZWJ virama RA
I wouldn't go so far. The fact that clusters belong together is something
that can be handled by the software. Collation and other data processing
needs to deal with such issues already for many other languages. See
http://www.unicode.org/reports/tr10 on the collation algorithm.
I
--- Kent Karlsson [EMAIL PROTECTED] wrote:
A space followed by a dependent vowel sign should display just the
dependent vowel sign, no dotted circle. Indeed, (except for a show
invisibles mode, or a character chart display mode) no (Indic or
other)
text that does not contain the
Keyur Shroff wrote:
But sometimes a user may want visual representation of these
symbols in two different ways: with dotted circle and
without dotted circle.
Why not using a dotted circle character explicity, when you want to see one?
Example of
this could be RAsup on top of dotted circle
Michael Everson wrote:
At 02:13 -0800 2003-01-29, Keyur Shroff wrote:
I beg to differ with you on this point. Merely having some provision for
composing a character doesn't mean that the character is not a candidate
for inclusion as separate code point.
Yes, it does.
India is a big
Aditya Gokhale wrote:
1. In Marathi and Sanskrit language two characters glyphs of
'la' and 'sha' are represented differently as shown in the
image below -
Actually, for everyone's information: these allographs for Marathi were
recently brought to our attention, and Unicode 4.0 will have a
Keyur Shroff wrote
Kent Karlsson [EMAIL PROTECTED] wrote:
A space followed by a dependent vowel sign should display just the
dependent vowel sign, no dotted circle. Indeed, (except for a show
invisibles mode, or a character chart display mode) no (Indic or
other)
text that does not
Christopher John Fynn wrote:
I had thought that the argument for including KSSA as a seperate
character in the Tibetan block (rather than only having U+0F40 and
U+0FB5) was originally for compatibility / cross mapping with
Devanagari and other Indic scripts.
Which is not a valid reason
--- Marco Cimarosti [EMAIL PROTECTED] wrote:
Keyur Shroff wrote:
But sometimes a user may want visual representation of these
symbols in two different ways: with dotted circle and
without dotted circle.
Why not using a dotted circle character explicity, when you want to see
one?
Note
http://archive.devx.com/free/tips/tipview.asp?content_id=4151
Who knew in this day and age flipping bits to change case is still publishable (this
is from today!)
Barry Caplan
www.i18n.com
Vendor Showcase: http://Showcase.i18n.com
--
20 matches
Mail list logo