Andries Brouwer said: > Turkish has i with dot and i without dot, > and unsurprisingly the upper case of dotted i is dotted I, > the lower case of dotless I is dotless i. > Now dotted i and dotless I are in the ASCII range (single UTF-8 byte), > while dotless i is U+0131, dotted I is U+0130. Both take two bytes. > > These are common vowels.
So you're saying if I do towlower(0x0130) (dotted I) in a Turkish locale I'll get 0x0069 (ASCII i)? Well at least I know what not to do and if I do do it I know where it will fail. Thanks Andries, Mike -- Linux-UTF8: i18n of Linux on all levels Archive: http://mail.nl.linux.org/linux-utf8/
