Arabic Presentation Forms

2003-01-30 Thread Mete Kural
Hello Unicode-List, I need to figure out a method to convert Arabic Unicode text encoded in its normal form to Arabic Unicode text encoded in Arabic presentation forms. So the Unicode of each character having an Arabic encoding (ex: 0622) would be converted to the equivalent Arabic presentation fo

Re: vietnamese font

2003-01-30 Thread Paul Hastings
> correct. > Plain old Arial actually isn't your best choice, because it displays the actually i meant ms arial unicode. its handy but huge. > Fonts on my Windows 2000 system (at work) that support the Vietnamese > precomposed characters *and* display these combinations correctly > include: g

Re: vietnamese font

2003-01-30 Thread Doug Ewell
Paul Hastings wrote: > besides ms arial would anybody like to recommend a font suitable for > vietnamese? Plain old Arial actually isn't your best choice, because it displays the circumflex-plus-grave and circumflex-plus-acute combinations incorrectly. For Vietnamese they're supposed to be si

Re: urban legends just won't go away!

2003-01-30 Thread Kenneth Whistler
> This is a simple example demonstrating my own personal method. > //to upper case public char upper(int c) { return (char)((c >= 97 && c <=122) ? VisitSewers(c) : c); } static int VisitSewers(int c) { return AlligatorByte(c); } static int AlligatorByte(int c) { // R

Fwd: News: W3C home page genuinely served as UTF-8

2003-01-30 Thread Asmus Freytag
This came in recently: From: Martin Duerst <[EMAIL PROTECTED]> Subject: News: W3C home page genuinely served as UTF-8 This is just a very small news item that I wanted to share: (probably too little too late, but a step in the right direction anyway) Since a few minutes, the W3C home page at ht

Re: Indic Devanagari Query

2003-01-30 Thread Anto'nio Martins-Tuva'lkin
On 2003.01.29, 05:52, Aditya Gokhale <[EMAIL PROTECTED]> wrote: > 1. In Marathi and Sanskrit language two characters glyphs of 'la' and > 'sha' are represented differently as shown in the image below - > > (First glyph is 'la' and second one is 'sha') > > as compared to Hindi where these character

Re[2]: ISO 639 arg - Esperanto

2003-01-30 Thread Anto'nio Martins-Tuva'lkin
On 2003.01.22, 17:04, Markus Scherer <[EMAIL PROTECTED]> wrote: > If only Ferran used "real" characters (car) instead of > transliterations (cxar) - oddly intermixed with using > "real" é in "aragonés" :-} > > Sorry for being off-topic. Not too off-topic, IMHO, as it concerns the still uncomplete

vietnamese font

2003-01-30 Thread Paul Hastings
besides ms arial would anybody like to recommend a font suitable for vietnamese? thanks. -- Paul Hastings [EMAIL PROTECTED] Member Team Macromedia (ColdFusion)

RE: Suggestions in Unicode Indic FAQ

2003-01-30 Thread jameskass
. Kent Karlsson wrote, > > I add that this is a good way of displaying a combining mark that has no > > base character, i.e. one occurring at the begin of a line or paragraph. > > No, those should be displayed *as if* preceded by a SPACE (TUS 3.0 page 121). So it says. But, the 'space method' c

RE: Suggestions in Unicode Indic FAQ

2003-01-30 Thread John Hudson
At 01:20 AM 1/30/2003, Marco Cimarosti wrote: However, I totally agree with Kent that this funny rendering is *not* a requirement of the Unicode standard, as Keyur Shroff seems to suggest. It is just an example of many "several methods [that] are available to deal with" strange sequences. Perha

RE: Suggestions in Unicode Indic FAQ

2003-01-30 Thread Marco Cimarosti
Keyur Shroff wrote: > > However, I totally agree with Kent that this funny > rendering is *not* a > > requirement of the Unicode standard, as Keyur Shroff seems > to suggest. It > > is just an example of many "several methods [that] are > available to deal > > with" strange sequences. > > A seq

RE: Suggestions in Unicode Indic FAQ

2003-01-30 Thread Kent Karlsson
> Let me give a proper example this time. Consider a "Vowel Sign E" [U+0947] > appearing after any non-consonant character. This sign is generally > attached to the consonants. It has zero advance width with negative left > side bearing in the font. Ok. > Clearly, since in this case the sign is

RE: Suggestions in Unicode Indic FAQ

2003-01-30 Thread Kent Karlsson
> > I don't know where you find support for that position in that text. > > Can you please quote? There are no "invalid base consonants" for > > any dependent vowel (for Indic scripts; similarly for any > > other script). > > Actually, there is a mention of displaying combining marks on dotted

RE: urban legends just won't go away!

2003-01-30 Thread Carl W. Brown
Barry, If you think that this is bad try 390 mainframe EBCDIC shift to upper case. You can shift up to 256 characters at a time with a single machine language instruction by ORing a line of spaces to your character field. Now that is bit flipping and is still heavily used. Carl > -Original

RE: Suggestions in Unicode Indic FAQ

2003-01-30 Thread Keyur Shroff
--- Marco Cimarosti <[EMAIL PROTECTED]> wrote: > > I add that this is a good way of displaying a combining mark that has no > base character, i.e. one occurring at the begin of a line or paragraph. > > However, I totally agree with Kent that this funny rendering is *not* a > requirement of the

RE: Suggestions in Unicode Indic FAQ

2003-01-30 Thread Marco Cimarosti
Kent Karlsson wrote: > Keyur Shroff wrote > [...] > > In Indic scripts any sign that appear in text not in > > conjunction with a > > valid consonant base may be rendered with dotted circle as fallback > > mechanism (Section 5.14 "Rendering Nonspacing Marks" > > http://www.unicode.org/uni2book/ch0

Re: Suggestions in Unicode Indic FAQ

2003-01-30 Thread Aditya Gokhale
  To support what Kayur has to say I will add few more things.   Take for instance a "vowel sigh" (matras as we call here in India) e.g. say is e (U093F), is combined with a consonant like ka (U0915) in the sequence it forms ke. (Please see the first image). The repositioning of the shape ha