Re: per-character "stories" in a database

2003-03-14 Thread jameskass
. William Overington wrote, > I find it strange that the Unicode Standard does not codify the > ligatures which can be produced with the languages of the Indian > subcontinent at display time using specific sequences of regular > Unicode characters so that someone skilled in the art of font des

Unicode Public Review Issues update

2003-03-14 Thread Rick McGowan
The Unicode "Public Review Issues" page has been updated today. Highlights: Closed issue #1 (Language tag deprecation) without any change. Updated some deadlines on other issues to June 1, 2003. Added a document for issue #7 (tailored normalizations). Added an issue #8 regarding p

Re: per-character "stories" in a database

2003-03-14 Thread Michael Everson
At 16:08 + 2003-03-14, William Overington wrote: I find it strange that the Unicode Standard does not codify the ligatures which can be produced with the languages of the Indian subcontinent at display time using specific sequences of regular Unicode characters so that someone skilled in th

Re: New document.

2003-03-14 Thread Kenneth Whistler
> Otto Stolz wrote: > > > > > The two scans under > > http://www.rz.uni-konstanz.de/Antivirus/tests/li.png > > http://www.rz.uni-konstanz.de/Antivirus/tests/re.png > > are from the authoritative (until July 1996) book on German > > orthography: Duden "Rechtschreibung der deutschen Sprache > >

Re: pinyin syllable `rua'

2003-03-14 Thread Yung-Fong Tang
Which pinyin system the "rua" is in? I use simpchinese win XP and if I switch to Full Spell (??)Simplified Chinese IME and type "rua', then I got "挼" (read this email in UTF-8) which is U+633C I am not sure that is correct. At least, as a native Mardarin speaker, that sound is not nature for me

Re: Unicode 4.0 chapter headings and numbering.

2003-03-14 Thread Kenneth Whistler
William Overington asked: > I wonder if you could please say whether the Unicode 4.0 book will have the > same chapter headings and numbering as the Unicode 3.0 book? They will be largely similar -- and identical for Chapters 1 through 5 -- but there are various reorganizations in the latter part

Unicode Conference Reminder

2003-03-14 Thread Lisa Moore
Folks, Getting down to the wire, time to get your registration in. Join us in Prague! Lisa * Register now! > Just 1 week to go! > Register now! > Just 1 week to go!

Re: New document.

2003-03-14 Thread Yung-Fong Tang
Otto Stolz wrote: The two scans under http://www.rz.uni-konstanz.de/Antivirus/tests/li.png http://www.rz.uni-konstanz.de/Antivirus/tests/re.png are from the authoritative (until July 1996) book on German orthography: Duden "Rechtschreibung der deutschen Sprache und der Fremdwörter" / hrsg. v

Re: Characters that rotate in vertical text

2003-03-14 Thread Yung-Fong Tang
I think that is a hard problem First of all. Take a look at http://www.unicode.org/Public/4.0-Update/UCD-4.0.0d5b.html and find the one Second, anything which need to be Symmetric Swap in Bidi probably need to be change in the vertical form. (If they need to be change in horizontal directio

Re: per-character "stories" in a database (derives from Re: geometric shapes)

2003-03-14 Thread John Hudson
At 08:08 AM 3/14/2003, William Overington wrote: I find it strange that the Unicode Standard does not codify the ligatures which can be produced with the languages of the Indian subcontinent at display time using specific sequences of regular Unicode characters so that someone skilled in the art o

Re: New document.

2003-03-14 Thread Mark Davis
The other point is that the images need to be sufficient to show that they are photocopies of real documents, with citations for the sources. Mark [EMAIL PROTECTED] IBM, MS 50-2/B11, 5600 Cottle Rd, SJ CA 95193 (408) 256-3148 fax: (408) 256-0799 - Original Message - From: "Rick M

Re: Unicode character transformation through XSLT

2003-03-14 Thread Markus Scherer
Nooo - Java's old "UTF" functions do not process UTF-8! They are there for String serialization, a Java-internal format. Use the Java Reader/Writer classes instead of these old ones! See the Java tutorials on Internationalization: http://java.sun.com/docs/books/tutorial/i18n/text/convertintro.htm

RE: Need encoding conversion routines

2003-03-14 Thread Marco Cimarosti
askq1 askq1 wrote: > >From: "Pim Blokland" <[EMAIL PROTECTED]> > > >However, you have said this is not what you want! > >So what is it that you do want? > > I want c/c++ code that will give me UTF8 byte sequence > representing a given code-point, > UTF16 16 bits sequence reppresenting a given >

Re: Need encoding conversion routines

2003-03-14 Thread Markus Scherer
Let's try this: ICU has C header files with macros for code point handling in UTF-8/16 strings. See the utf8.h and utf16.h headers (together with utf.h) in ICU's source tree at source/common/unicode/. http://oss.software.ibm.com/icu/download/ http://oss.software.ibm.com/cvs/icu/icu/source/common

Re: Need encoding conversion routines

2003-03-14 Thread Edward H Trager
If you need a utility to do these conversions for you right away, take a look at "uniconv" which is part of Gaspar Sinai's Yudit unicode editor. This is an Open Source program, so you can look at the code too: http://www.yudit.org For C++ and Java libraries, check out IBM's International Compone

Re: New document.

2003-03-14 Thread Rick McGowan
Otto Stolz pointed to: http://www.rz.uni-konstanz.de/Antivirus/tests/li.png http://www.rz.uni-konstanz.de/Antivirus/tests/re.png Yes, some of those are worth proposing, I would say. The in-line textual usage is pretty well given by the two images. But someone needs to investigate the mean

Re: pinyin syllable `rua'

2003-03-14 Thread Thomas Chan
On Fri, 14 Mar 2003, Werner LEMBERG wrote: > Some lists of pinyin syllables contain `rua', but I actually can't > find any Chinese character with this name. Not all words or utterances necessarily have written forms, but... > Does it exist at all? Or is it just there for completeness of pinyi

Re: pinyin syllable `rua'

2003-03-14 Thread Andrew C. West
On Fri, 14 Mar 2003 08:43:19 -0800 (PST), Werner LEMBERG wrote: > Some lists of pinyin syllables contain `rua', but I actually can't > find any Chinese character with this name. > > Does it exist at all? Or is it just there for completeness of pinyin? This is not really a Unicode question, and

Re: Need encoding conversion routines

2003-03-14 Thread Otto Stolz
askq1 askq1 wrote: Actually my requirement is striaght-forward/common and I believe it should be available somewhere on net. In particular I need source code (or some way) for following requirements: - Convert Unicode code-point to UTF8 encoding and vice-versa. - Convert Unicode code-point to UCS

Re: per-character "stories" in a database (derives from Re: geometric shapes)

2003-03-14 Thread William Overington
Markus Scherer wrote as follows. quote It has been suggested many times to build a database (list, document, XML, ...) where each designated/assigned code point and each character gets its "story": Comments on the glyphs, from what codepage it was inherited, usage comments and examples, alternate

Re: Need encoding conversion routines

2003-03-14 Thread askq1 askq1
From: "Pim Blokland" <[EMAIL PROTECTED]> However, you have said this is not what you want! So what is it that you do want? I want c/c++ code that will give me UTF8 byte sequence representing a given code-point, UTF16 16 bits sequence reppresenting a given code-point, UTF32 32 bits sequence repre

pinyin syllable `rua'

2003-03-14 Thread Werner LEMBERG
Some lists of pinyin syllables contain `rua', but I actually can't find any Chinese character with this name. Does it exist at all? Or is it just there for completeness of pinyin? Werner

Re: Need encoding conversion routines

2003-03-14 Thread Pim Blokland
askq1 askq1 schreef: > Character U+4321 is the unicode code-point but to store this character into > a file we need to use a certain encoding format. Yes. That depends on the implemention. If your character is kept in memory as a 16 bits type, that's simply an short integer with the hex value 0x4

RE: per-character "stories" in a database

2003-03-14 Thread Dominikus Scherkl
> It has been suggested many times to build a database (list, > document, XML, ...) where each > designated/assigned code point and each character gets its > "story": Comments on the glyphs, from > what codepage it was inherited, usage comments and examples, > alternate names, etc. > > I am t

Re: Need encoding conversion routines

2003-03-14 Thread askq1 askq1
From: "Pim Blokland" <[EMAIL PROTECTED]> To: "Unicode mailing list" <[EMAIL PROTECTED]> Subject: Re: Need encoding conversion routines Date: Fri, 14 Mar 2003 12:30:44 +0100 askq1 askq1 schreef: > In particular I need source code (or some way) for following requirements: > - Convert Unicode code-po

Re: Need encoding conversion routines

2003-03-14 Thread Pim Blokland
askq1 askq1 schreef: > In particular I need source code (or some way) for following requirements: > - Convert Unicode code-point to UTF8 encoding and vice-versa. > - Convert Unicode code-point to UCS2 encoding and vice-versa. > - Convert Unicode code-point to UTF16 encoding and vice-versa. Ahem.

Re: Need encoding conversion routines

2003-03-14 Thread jameskass
. Pim Blokland wrote in response to "K.", > > another. (UTF8 to UTF16 etc.) But how can I convert from a > > unicode code-point to some encoding and decode from some > > encoding to unicode code-point? In brief, I want encoding > > and decoding functions for Unicode. > > What is "some encoding"?

Re: Tolkien wanta-be has created entirely new language

2003-03-14 Thread jameskass
. C.T.M. Jacobs wrote, > And yet Tengwar seems to redefine the ASCII range ... > > http://www.gis.net/~dansmith/fonts/tengwar.htm > Not Tengwar itself, of course, but rather the various font developers who have made these "custom-encoded" fonts. The less charitable among us call such fonts "h

Re: Need encoding conversion routines

2003-03-14 Thread askq1 askq1
From: "Pim Blokland" <[EMAIL PROTECTED]> What is "some encoding"? You mean codepage-based 8-bit character sets? By encoding I mean UTF8, UTF16, UTF32, UCS2 etc. Actually my requirement is striaght-forward/common and I believe it should be available somewhere on net. In particular I need source co

Progressing genealogical symbols

2003-03-14 Thread Michael Everson
At 10:01 +0100 2003-03-14, Otto Stolz wrote: Dominikus Scherkl had written: Has anybody meanwhile contributed a proposal regarding the missing genealogical symbols? (after Otto Stolz's message from 24.02.2003 I wondered this was not proposed long ago - or was it and was rejected for some reason?

Re: Tolkien wanta-be has created entirely new language

2003-03-14 Thread c.t.m. jacobs
- Original Message - From: "Doug Ewell" <[EMAIL PROTECTED]> To: "Unicode Mailing List" <[EMAIL PROTECTED]> Cc: "Ruddy, James" <[EMAIL PROTECTED]>; "John Hudson" <[EMAIL PROTECTED]> Sent: Thursday, March 13, 2003 11:09 PM Subject: Re: Tolkien wanta-be has created entirely new language [ .

Re: Need encoding conversion routines

2003-03-14 Thread Pim Blokland
askq1 askq1 schreef: > another. (UTF8 to UTF16 etc.) But how can I convert from a > unicode code-point to some encoding and decode from some > encoding to unicode code-point? In brief, I want encoding > and decoding functions for Unicode. What is "some encoding"? You mean codepage-based 8-bit cha

Unicode 4.0 chapter headings and numbering.

2003-03-14 Thread William Overington
I wonder if you could please say whether the Unicode 4.0 book will have the same chapter headings and numbering as the Unicode 3.0 book? My reason for asking is that I am writing a paper about the possible problems with using languages of the Indian subcontinent on the DVB-MHP (Digital Video Broad

Re: Ligatures fj etc (from Re: Ligatures (qj) )

2003-03-14 Thread William Overington
Yesterday, 13 March 2003, I wrote as follows. quote So I reasoned that the system might scan through a font when it is loaded and decide upon the lowest point for the whole font and then proceed on that basis. end quote An email correspondent has kindly written to me privately and I now know th

Re: New document.

2003-03-14 Thread Otto Stolz
Dominikus Scherkl had written: > Has anybody meanwhile contributed a proposal > regarding the missing genealogical symbols ? > (after Otto Stolz's message from 24.02.2003 I wondered this > was not proposed long ago - or was it and was rejected for > some reason?!?). So do I -- however, I am not in

Re: sorting order between win98/xp

2003-03-14 Thread Michael \(michka\) Kaplan
I have tried it on Win98, WinME, Win2000, WinXP, and Windows Server 2003. LCMapStringA/CompareStringA on all five and LCMapStringW/CompareStringW on the NT-ish platforms. I do not and cannot repro the reported problem of your colleague. Please have that colleague send me some email with their rep