Michael Everson [EMAIL PROTECTED] writes:
It is common enough. It is more common in Sweden than it is in Germany.
I can't compare with Germany, but I wouldn't say that it's common.
I could think of it as a gimmick, but I would be inclined to say that
it is more common to use cyrillic letter
Michael Everson had written:
It was more common in Germany, Sweden, and Estonia earlier this century than
it is today.
On 2001-01-17 at 09:22 h UCT, Erland Sommarskog wrote:
You mean that were was a fad last year? I have to confess that I missed
it.
You mean, this very month? (Rather than
Ar 13:50 -0800 2001-01-16, scrobh [EMAIL PROTECTED]:
In the better known Indic scripts, are there ever cases of conjuncts formed
with independent vowels and a following consonant?
Not in the better-known ones, except possibly in esoteric manuscripts. One
finds weird stacking behaviour in Tibetan
Michael Everson wrote:
Ar 13:50 -0800 2001-01-16, scrobh [EMAIL PROTECTED]:
Now, suppose a VC conjunct were to occur, as described above; "al", for
example. Would it seem preferable to treat the vowel like a consonant, and
encode as
A + virama + L
or to treat the consonant, and
I am investigating using the Unicode standard to store and forward
Chinese characters in a mainframe (IMS) environment.
Basically we want to receive Chinese into the system, encode into
UNICODE, send it to the mainframe and store on the IMSDB. At a later
stage, then decode back into Chinese for
Unicode is always serialized in a UTF: UTF-8, UTF-16*, or UTF-32*. The
definition of each of these is invariant across systems: in UTF-8 an 'a' is
always stored as 0x61. There is a special UTF for use on EBCDIC systems.
Check out the technical reports and FAQs on www.unicode.org.
Mark
-
Within the IMS database, any form of data can be stored. Beware, however,
that certain parameters, such as the transaction name, must always be in
EBCDIC. While the database itself can handle Unicode in any format, you
have to be careful about how you work with that data - the IMS Transaction
On 01/17/2001 06:05:15 AM Antoine Leca wrote:
Of course, in regular Nagari, one ought to encode A +
virama + La/0932 (+ virama if followed by a consonant or at end of the
word
in Sanskrit), as this is the way it is written.
This is actually done? I got the impression from reading chapter 9 in
On Wed, 17 Jan 2001 [EMAIL PROTECTED] wrote:
On 01/17/2001 05:13:25 AM Michael Everson wrote:
A + Ldep
No such thing as Ldep in our model, so you'd have to rely on A + virama +
L.
Well, if a script had such behaviour, one possibility could be to propose a
combining CONSONANT
On Tue, 16 Jan 2001, Mark Davis wrote:
Doug Felt here confirmed that this is a bug in the implementation section.
While it does not affect the conformance of the main algorithm, it would
affect people trying to use that optimization strategy. (we here don't use
that strategy, by the way).
In Bengali Vowel_A can form a conjunct with letter_Ya (Ya taking its zophola form.)
It has been suggested that this should be encoded as Vowel_A ZWJ Ya
I believe that the series V ZWJ C is much more logical than V Virama C as the
semantics of virama are to suppress the vowel.
Abdul
On 01/17/2001 02:52:41 PM John Hudson wrote:
Are thes four consonants always joined in this way when following an
independent vowel? Or is this behaviour exceptional and limited to
borrowed
words, etc.?
My understanding is the latter. Thus, I don't think obligatory ligation
would work.
-
On 01/17/2001 03:10:22 PM "AbdulMalik" wrote:
In Bengali Vowel_A can form a conjunct with letter_Ya (Ya taking its
zophola
form.)
It has been suggested that this should be encoded as Vowel_A ZWJ Ya
I believe that the series V ZWJ C is much more logical than V Virama C as
the
semantics of
Yes, I have already proposed an agenda item for the next UTC, to get this
fix into 3.1.
Mark
___
Mark Davis, IBM GCoC, Cupertino
(408) 777-5850 [fax: 5891], [EMAIL PROTECTED], [EMAIL PROTECTED]
http://maps.yahoo.com/py/maps.py?Pyt=Tmapaddr=10275+N.+De+Anzacsz=95014
Roozbeh Pournader [EMAIL
Hi everyone,
I'm preparing some mappings of teletext character sets to Unicode. You can
see my results so far at
http://www.sneezes.freeserve.co.uk/teletext/tech/charenc/teletextcharencs.ht
ml
[hope that URL doesn't get split..] This is a LARGE page, btw (150k). In
IE5+, hover over the
Proposed Draft Unicode Technical Report #27: Unicode 3.1 is now
available at
http://www.unicode.org/unicode/reports/tr27/
Please take a look at it and report any problems you may find. It is
approximately 60 pages long.
Julie Allen
Editor, Unicode, Inc.
16 matches
Mail list logo