Re: mbstoupper or utf8toupper

Michael B Allen Wed, 05 Jan 2005 19:58:49 -0800

Andries Brouwer said:
> Turkish has i with dot and i without dot,
> and unsurprisingly the upper case of dotted i is dotted I,
> the lower case of dotless I is dotless i.
> Now dotted i and dotless I are in the ASCII range (single UTF-8 byte),
> while dotless i is U+0131, dotted I is U+0130. Both take two bytes.
>
> These are common vowels.


So you're saying if I do towlower(0x0130) (dotted I) in a Turkish locale
I'll get 0x0069 (ASCII i)?

Well at least I know what not to do and if I do do it I know where it will
fail.

Thanks Andries,
Mike

--
Linux-UTF8:   i18n of Linux on all levels
Archive:      http://mail.nl.linux.org/linux-utf8/

Re: mbstoupper or utf8toupper

Reply via email to