> > GHC 6.2 (shortly to be released) also supports toUpper, toLower, and > the > > character predicates isUpper, isLower etc. on the full Unicode > character > > set. > > > > There is one caveat: the implementation is based on the C library's > > towupper() and so on, so the support is only as good as the > C library > > provides, and it relies on wchar_t being equivalent to Unicode (the > > sensible choice, but not all libcs do this). > > Now, why would one want to base this on C's wchar_t and its > "w" routines?
Because it's easy, and it's still an improvement over what we had before. > wchar_t is sometimes (isolated) UTF-32 code units, > including in Linux, sometimes it is (isolated) UTF-16 code units, > including in Windows, and sometimes something utterly useless. Yes, I mentioned this above. > Please instead use ICU's UChar32, Thanks, I didn't know about ICU - it looks nice. I did briefly investigate libunicode, which turned out to be worse than the C library. Cheers, Simon _______________________________________________ Haskell mailing list [EMAIL PROTECTED] http://www.haskell.org/mailman/listinfo/haskell