Hello Unicode-List,
I need to figure out a method to convert Arabic
Unicode text encoded in its normal form to Arabic
Unicode text encoded in Arabic presentation forms. So
the Unicode of each character having an Arabic
encoding (ex: 0622) would be converted to the
equivalent Arabic presentation fo
>
correct.
> Plain old Arial actually isn't your best choice, because it displays the
actually i meant ms arial unicode. its handy but huge.
> Fonts on my Windows 2000 system (at work) that support the Vietnamese
> precomposed characters *and* display these combinations correctly
> include:
g
Paul Hastings wrote:
> besides ms arial would anybody like to recommend a font suitable for
> vietnamese?
Plain old Arial actually isn't your best choice, because it displays the
circumflex-plus-grave and circumflex-plus-acute combinations
incorrectly. For Vietnamese they're supposed to be si
> This is a simple example demonstrating my own personal method.
>
//to upper case
public char upper(int c)
{
return (char)((c >= 97 && c <=122) ? VisitSewers(c) : c);
}
static int VisitSewers(int c)
{
return AlligatorByte(c);
}
static int AlligatorByte(int c)
{
// R
This came in recently:
From: Martin Duerst <[EMAIL PROTECTED]>
Subject: News: W3C home page genuinely served as UTF-8
This is just a very small news item that I wanted to share:
(probably too little too late, but a step in the right
direction anyway)
Since a few minutes, the W3C home page at ht
On 2003.01.29, 05:52, Aditya Gokhale <[EMAIL PROTECTED]> wrote:
> 1. In Marathi and Sanskrit language two characters glyphs of 'la' and
> 'sha' are represented differently as shown in the image below -
>
> (First glyph is 'la' and second one is 'sha')
>
> as compared to Hindi where these character
On 2003.01.22, 17:04, Markus Scherer <[EMAIL PROTECTED]> wrote:
> If only Ferran used "real" characters (car) instead of
> transliterations (cxar) - oddly intermixed with using
> "real" é in "aragonés" :-}
>
> Sorry for being off-topic.
Not too off-topic, IMHO, as it concerns the still uncomplete
besides ms arial would anybody like to recommend a font suitable for
vietnamese?
thanks.
--
Paul Hastings [EMAIL PROTECTED]
Member Team Macromedia (ColdFusion)
.
Kent Karlsson wrote,
> > I add that this is a good way of displaying a combining mark that has no
> > base character, i.e. one occurring at the begin of a line or paragraph.
>
> No, those should be displayed *as if* preceded by a SPACE (TUS 3.0 page 121).
So it says. But, the 'space method' c
At 01:20 AM 1/30/2003, Marco Cimarosti wrote:
However, I totally agree with Kent that this funny rendering is *not* a
requirement of the Unicode standard, as Keyur Shroff seems to suggest. It is
just an example of many "several methods [that] are available to deal with"
strange sequences.
Perha
Keyur Shroff wrote:
> > However, I totally agree with Kent that this funny
> rendering is *not* a
> > requirement of the Unicode standard, as Keyur Shroff seems
> to suggest. It
> > is just an example of many "several methods [that] are
> available to deal
> > with" strange sequences.
>
> A seq
> Let me give a proper example this time. Consider a "Vowel Sign E" [U+0947]
> appearing after any non-consonant character. This sign is generally
> attached to the consonants. It has zero advance width with negative left
> side bearing in the font.
Ok.
> Clearly, since in this case the sign is
> > I don't know where you find support for that position in that text.
> > Can you please quote? There are no "invalid base consonants" for
> > any dependent vowel (for Indic scripts; similarly for any
> > other script).
>
> Actually, there is a mention of displaying combining marks on dotted
Barry,
If you think that this is bad try 390 mainframe EBCDIC shift to upper case.
You can shift up to 256 characters at a time with a single machine language
instruction by ORing a line of spaces to your character field. Now that is
bit flipping and is still heavily used.
Carl
> -Original
--- Marco Cimarosti <[EMAIL PROTECTED]> wrote:
>
> I add that this is a good way of displaying a combining mark that has no
> base character, i.e. one occurring at the begin of a line or paragraph.
>
> However, I totally agree with Kent that this funny rendering is *not* a
> requirement of the
Kent Karlsson wrote:
> Keyur Shroff wrote
> [...]
> > In Indic scripts any sign that appear in text not in
> > conjunction with a
> > valid consonant base may be rendered with dotted circle as fallback
> > mechanism (Section 5.14 "Rendering Nonspacing Marks"
> > http://www.unicode.org/uni2book/ch0
To support what Kayur has to say I will add few more
things.
Take for instance a "vowel sigh" (matras as we call here in
India) e.g. say is e (U093F), is combined with a consonant like ka (U0915) in
the sequence it forms ke. (Please see the first image). The repositioning of the
shape ha
17 matches
Mail list logo