Roland Mainz wrote at 08/28/06 02:27: >>>How do I specify Unicode outside the Basic Multilingual Plane (BMP, >>>which uses values larger than 2^16) in ksh93? >> >>will handle up to 2^32 bits > > > What about the idea with "\w" to have a way to specify a widechar value > (CC:'ing Ienup and i18n-discuss at opensolaris.org for feedback. The idea > was to add "\wXXXX" to allow someone to specify a locale-specific > widechar value in a similar way how "\uXXXX" can be used to specific a > unicode value in ksh93's "printf" command (see > https://mailman.research.att.com/pipermail/ast-users/2006q3/001220.html) > - which may be usefull for locales like *.GB18030, ja_JP.PCK etc.) ?
In Solaris Unicode/UTF-8 locales, wide characters will be in UTF-32 and that's guaranteed. However, in any other non-UTF-8 locales, wide characters in wchar_t are all treated as opaque and actually not in UTF-32 but something else and they are also different depending on what is the current locale. For the Unicode characters, however, as like C/C++, using \u and \U appears a good idea. Ienup