RE: Encoding conversions

Carl W. Brown Mon, 10 Sep 2001 16:39:13 -0700
Juliusz,

> -----Original Message-----
> From: [EMAIL PROTECTED]
> [mailto:[EMAIL PROTECTED]]On Behalf Of Juliusz Chroboczek
> Sent: Monday, September 10, 2001 2:12 PM
> To: [EMAIL PROTECTED]; [EMAIL PROTECTED]
> Subject: Re: Encoding conversions
>
>
> > wchar_t is a very wrong thing to normalize to, because it is OS and
> > locale dependent.

I know that it is OS dependent but how is it locale dependent?

>
>   #ifdef __STDC_ISO_10646__
>     typedef whar_t uchar_t;
>   #else
>     typedef unsigned long uchar_t;
>   #endif
>

I think you meant:

     typedef wchar_t uchar_t;

There are another rationales for defining you own.  First it make it easier
to port to another platform.  I have defined UChar32, UChar16, and UChar8.
Second if you use a re-definition like UChar8 for all UTF-8 code it will
make sure that you use unsigned char consistently and not just char.  This
is something that will also bite you especially with different compilers.

Carl



-
Linux-UTF8:   i18n of Linux on all levels
Archive:      http://mail.nl.linux.org/linux-utf8/
RE: Encoding conversions

Reply via email to