Hello,

Am 24.11.2015 um 21:57 schrieb D'Arcy J.M. Cain:
> On Tue, 24 Nov 2015 21:02:30 +0100
> Christoph Zwerschke <[email protected]> wrote:
...
>> that also takes the locale into account. So checking >= '0' && <= '9'
>> is faster, but moreover we want to be as restrictive as possible and
>> not have other characters considered digits because of whatever
>> strange interpretation of the locale. For instance, '\xb2' would be
>> considered a digit on Windows because it is a superscript 2 in cp1252.
> 
> What about a unicode character where the second or third octet falls
> into the '0' to '9' range?  It seems to me that we really need unicode
> versions of ctype functions.

Will not happen: All subsequent bytes of UNICODE characters encoded in
UTF-8 will have the MSB set. See `man 7 utf8`

Philipp
_______________________________________________
PyGreSQL mailing list
[email protected]
https://mail.vex.net/mailman/listinfo.cgi/pygresql

Reply via email to