.
William Overington wrote,
> I find it strange that the Unicode Standard does not codify the
> ligatures which can be produced with the languages of the Indian
> subcontinent at display time using specific sequences of regular
> Unicode characters so that someone skilled in the art of font des
The Unicode "Public Review Issues" page has been updated today.
Highlights:
Closed issue #1 (Language tag deprecation) without any change.
Updated some deadlines on other issues to June 1, 2003.
Added a document for issue #7 (tailored normalizations).
Added an issue #8 regarding p
At 16:08 + 2003-03-14, William Overington wrote:
I find it strange that the Unicode Standard does not codify the
ligatures which can be produced with the languages of the Indian
subcontinent at display time using specific sequences of regular
Unicode characters so that someone skilled in th
> Otto Stolz wrote:
>
> >
> > The two scans under
> > http://www.rz.uni-konstanz.de/Antivirus/tests/li.png
> > http://www.rz.uni-konstanz.de/Antivirus/tests/re.png
> > are from the authoritative (until July 1996) book on German
> > orthography: Duden "Rechtschreibung der deutschen Sprache
> >
Which pinyin system the "rua" is in?
I use simpchinese win XP and if I switch to Full Spell (??)Simplified
Chinese IME and type "rua', then I got "挼" (read this email in UTF-8)
which is U+633C
I am not sure that is correct. At least, as a native Mardarin speaker,
that sound is not nature for me
William Overington asked:
> I wonder if you could please say whether the Unicode 4.0 book will have the
> same chapter headings and numbering as the Unicode 3.0 book?
They will be largely similar -- and identical for Chapters 1 through
5 -- but there are various reorganizations in the latter part
Folks,
Getting down to the wire, time to get your registration in. Join us in
Prague!
Lisa
*
Register now! > Just 1 week to go! > Register now! > Just 1 week to go!
Otto Stolz wrote:
The two scans under
http://www.rz.uni-konstanz.de/Antivirus/tests/li.png
http://www.rz.uni-konstanz.de/Antivirus/tests/re.png
are from the authoritative (until July 1996) book on German
orthography: Duden "Rechtschreibung der deutschen Sprache
und der Fremdwörter" / hrsg. v
I think that is a hard problem
First of all. Take a look at
http://www.unicode.org/Public/4.0-Update/UCD-4.0.0d5b.html
and find the one
Second, anything which need to be Symmetric Swap in Bidi probably need to
be change in the vertical form. (If they need to be change in horizontal
directio
At 08:08 AM 3/14/2003, William Overington wrote:
I find it strange that the
Unicode Standard does not codify the ligatures which can be produced with
the languages of the Indian subcontinent at display time using specific
sequences of regular Unicode characters so that someone skilled in the art
o
The other point is that the images need to be sufficient to show that they
are photocopies of real documents, with citations for the sources.
Mark
[EMAIL PROTECTED]
IBM, MS 50-2/B11, 5600 Cottle Rd, SJ CA 95193
(408) 256-3148
fax: (408) 256-0799
- Original Message -
From: "Rick M
Nooo - Java's old "UTF" functions do not process UTF-8! They are there for String serialization, a
Java-internal format.
Use the Java Reader/Writer classes instead of these old ones!
See the Java tutorials on Internationalization:
http://java.sun.com/docs/books/tutorial/i18n/text/convertintro.htm
askq1 askq1 wrote:
> >From: "Pim Blokland" <[EMAIL PROTECTED]>
>
> >However, you have said this is not what you want!
> >So what is it that you do want?
>
> I want c/c++ code that will give me UTF8 byte sequence
> representing a given code-point,
> UTF16 16 bits sequence reppresenting a given
>
Let's try this:
ICU has C header files with macros for code point handling in UTF-8/16 strings. See the utf8.h and
utf16.h headers (together with utf.h) in ICU's source tree at source/common/unicode/.
http://oss.software.ibm.com/icu/download/
http://oss.software.ibm.com/cvs/icu/icu/source/common
If you need a utility to do these conversions for you right away, take a
look at "uniconv" which is part of Gaspar Sinai's Yudit unicode editor.
This is an Open Source program, so you can look at the code too:
http://www.yudit.org
For C++ and Java libraries, check out IBM's International Compone
Otto Stolz pointed to:
http://www.rz.uni-konstanz.de/Antivirus/tests/li.png
http://www.rz.uni-konstanz.de/Antivirus/tests/re.png
Yes, some of those are worth proposing, I would say.
The in-line textual usage is pretty well given by the two images. But
someone needs to investigate the mean
On Fri, 14 Mar 2003, Werner LEMBERG wrote:
> Some lists of pinyin syllables contain `rua', but I actually can't
> find any Chinese character with this name.
Not all words or utterances necessarily have written forms, but...
> Does it exist at all? Or is it just there for completeness of pinyi
On Fri, 14 Mar 2003 08:43:19 -0800 (PST), Werner LEMBERG wrote:
> Some lists of pinyin syllables contain `rua', but I actually can't
> find any Chinese character with this name.
>
> Does it exist at all? Or is it just there for completeness of pinyin?
This is not really a Unicode question, and
askq1 askq1 wrote:
Actually my requirement is striaght-forward/common and I believe it
should be available somewhere on net.
In particular I need source code (or some way) for following requirements:
- Convert Unicode code-point to UTF8 encoding and vice-versa.
- Convert Unicode code-point to UCS
Markus Scherer wrote as follows.
quote
It has been suggested many times to build a database (list, document, XML,
...) where each designated/assigned code point and each character gets its
"story": Comments on the glyphs, from what codepage it was inherited, usage
comments and examples, alternate
From: "Pim Blokland" <[EMAIL PROTECTED]>
However, you have said this is not what you want!
So what is it that you do want?
I want c/c++ code that will give me UTF8 byte sequence representing a given
code-point, UTF16 16 bits sequence reppresenting a given code-point, UTF32
32 bits sequence repre
Some lists of pinyin syllables contain `rua', but I actually can't
find any Chinese character with this name.
Does it exist at all? Or is it just there for completeness of pinyin?
Werner
askq1 askq1 schreef:
> Character U+4321 is the unicode code-point but to store this
character into
> a file we need to use a certain encoding format.
Yes. That depends on the implemention. If your character is kept in
memory as a 16 bits type, that's simply an short integer with the
hex value 0x4
> It has been suggested many times to build a database (list,
> document, XML, ...) where each
> designated/assigned code point and each character gets its
> "story": Comments on the glyphs, from
> what codepage it was inherited, usage comments and examples,
> alternate names, etc.
>
> I am t
From: "Pim Blokland" <[EMAIL PROTECTED]>
To: "Unicode mailing list" <[EMAIL PROTECTED]>
Subject: Re: Need encoding conversion routines
Date: Fri, 14 Mar 2003 12:30:44 +0100
askq1 askq1 schreef:
> In particular I need source code (or some way) for following
requirements:
> - Convert Unicode code-po
askq1 askq1 schreef:
> In particular I need source code (or some way) for following
requirements:
> - Convert Unicode code-point to UTF8 encoding and vice-versa.
> - Convert Unicode code-point to UCS2 encoding and vice-versa.
> - Convert Unicode code-point to UTF16 encoding and vice-versa.
Ahem.
.
Pim Blokland wrote in response to "K.",
> > another. (UTF8 to UTF16 etc.) But how can I convert from a
> > unicode code-point to some encoding and decode from some
> > encoding to unicode code-point? In brief, I want encoding
> > and decoding functions for Unicode.
>
> What is "some encoding"?
.
C.T.M. Jacobs wrote,
> And yet Tengwar seems to redefine the ASCII range ...
>
> http://www.gis.net/~dansmith/fonts/tengwar.htm
>
Not Tengwar itself, of course, but rather the various font developers
who have made these "custom-encoded" fonts.
The less charitable among us call such fonts "h
From: "Pim Blokland" <[EMAIL PROTECTED]>
What is "some encoding"? You mean codepage-based 8-bit character
sets?
By encoding I mean UTF8, UTF16, UTF32, UCS2 etc.
Actually my requirement is striaght-forward/common and I believe it should
be available somewhere on net.
In particular I need source co
At 10:01 +0100 2003-03-14, Otto Stolz wrote:
Dominikus Scherkl had written:
Has anybody meanwhile contributed a proposal regarding the missing
genealogical symbols? (after Otto Stolz's message from 24.02.2003 I
wondered this was not proposed long ago - or was it and was
rejected for some reason?
- Original Message -
From: "Doug Ewell" <[EMAIL PROTECTED]>
To: "Unicode Mailing List" <[EMAIL PROTECTED]>
Cc: "Ruddy, James" <[EMAIL PROTECTED]>; "John Hudson" <[EMAIL PROTECTED]>
Sent: Thursday, March 13, 2003 11:09 PM
Subject: Re: Tolkien wanta-be has created entirely new language
[ .
askq1 askq1 schreef:
> another. (UTF8 to UTF16 etc.) But how can I convert from a
> unicode code-point to some encoding and decode from some
> encoding to unicode code-point? In brief, I want encoding
> and decoding functions for Unicode.
What is "some encoding"? You mean codepage-based 8-bit cha
I wonder if you could please say whether the Unicode 4.0 book will have the
same chapter headings and numbering as the Unicode 3.0 book?
My reason for asking is that I am writing a paper about the possible
problems with using languages of the Indian subcontinent on the DVB-MHP
(Digital Video Broad
Yesterday, 13 March 2003, I wrote as follows.
quote
So I reasoned that the system might scan through a font when it is loaded
and decide upon the lowest point for the whole font and then proceed on that
basis.
end quote
An email correspondent has kindly written to me privately and I now know
th
Dominikus Scherkl had written:
> Has anybody meanwhile contributed a proposal
> regarding the missing genealogical symbols ?
> (after Otto Stolz's message from 24.02.2003 I wondered this
> was not proposed long ago - or was it and was rejected for
> some reason?!?).
So do I -- however, I am not in
I have tried it on Win98, WinME, Win2000, WinXP, and Windows Server
2003. LCMapStringA/CompareStringA on all five and
LCMapStringW/CompareStringW on the NT-ish platforms.
I do not and cannot repro the reported problem of your colleague.
Please have that colleague send me some email with their rep
36 matches
Mail list logo