Re: [Python-Dev] Reject characters bigger than U+10FFFF and Solaris issues

2011-12-08 Thread Stefan Krah
Victor Stinner wrote: > The problem is not directly in the C localeconv() function, but in > mbstowcs() with the hu_HU locale. Ah, I see. > You can try my test program for this issue: > http://bugs.python.org/file23876/localeconv_wchar.c Can't test on OpenSolaris, since Oracle removed the pac

Re: [Python-Dev] Reject characters bigger than U+10FFFF and Solaris issues

2011-12-08 Thread Victor Stinner
Le 08/12/2011 10:17, Stefan Krah a écrit : I'm think that b'\xA0' is a valid thousands separator. I agree, but it's not the point: the problem is that b'\xA0' is decoded to a strange U+3020 character by mbstowcs(). Currently I have this horrible function to deal with the problem: ...

Re: [Python-Dev] Reject characters bigger than U+10FFFF and Solaris issues

2011-12-08 Thread Stefan Krah
Stefan Krah wrote: > I'm not sure why the b'\xA0' problem only occurs in Solaris. Many systems > have this thousands separator. Are LC_CTYPE and LC_NUMERIC set to the same value on the buildbot? Otherwise you encounter http://bugs.python.org/issue7442 . Stefan Krah ___

Re: [Python-Dev] Reject characters bigger than U+10FFFF and Solaris issues

2011-12-08 Thread Stefan Krah
Victor Stinner wrote: > For localeconv(), it is the b'\xA0' byte string decoded from an encoding > looking like ISO-8859-?? (b'\xA0' is not decodable from UTF-8). It looks like > a bug in the decoder. It also looks like OpenIndiana doesn't use ISO-8859 > locale anymore, only UTF-8 locales (whic