Hello, Am 24.11.2015 um 21:57 schrieb D'Arcy J.M. Cain: > On Tue, 24 Nov 2015 21:02:30 +0100 > Christoph Zwerschke <[email protected]> wrote: ... >> that also takes the locale into account. So checking >= '0' && <= '9' >> is faster, but moreover we want to be as restrictive as possible and >> not have other characters considered digits because of whatever >> strange interpretation of the locale. For instance, '\xb2' would be >> considered a digit on Windows because it is a superscript 2 in cp1252. > > What about a unicode character where the second or third octet falls > into the '0' to '9' range? It seems to me that we really need unicode > versions of ctype functions.
Will not happen: All subsequent bytes of UNICODE characters encoded in UTF-8 will have the MSB set. See `man 7 utf8` Philipp _______________________________________________ PyGreSQL mailing list [email protected] https://mail.vex.net/mailman/listinfo.cgi/pygresql
