Re: [HACKERS] Regexps vs. locale

2009-01-06 Thread Bruce Momjian
Added to TODO: Add ability to use case-insensitive regular expressions on multi-byte characters ILIKE already works with multi-byte characters * http://archives.postgresql.org/pgsql-hackers/2008-12/msg00433.php

Re: [HACKERS] Regexps vs. locale

2008-12-10 Thread Tom Lane
Andrew Gierth [EMAIL PROTECTED] writes: Tom == Tom Lane [EMAIL PROTECTED] writes: Tom It's not so much intentional as that no one has gotten around to Tom making it work. The difficulty is that the wide-char codes we Tom are using might not match what the wctype.h functions expect, Tom

[HACKERS] Regexps vs. locale

2008-12-08 Thread Andrew Gierth
This came up on irc: postgres=# show lc_ctype; lc_ctype - fr_FR.UTF-8 postgres=# show server_encoding; server_encoding - UTF8 (1 row) postgres=# select E'\303\201' ILIKE E'\303\241'; ?column? -- t (1 row) postgres=# select E'\303\201' ~*

Re: [HACKERS] Regexps vs. locale

2008-12-08 Thread Tom Lane
Andrew Gierth [EMAIL PROTECTED] writes: Obviously, this happens because the locale support functions in backend/regex/regc_locale.c are (presumably intentionally) crippled so as not to support non-ascii chars, despite all the code there using wide chars for everything otherwise. It's not so

Re: [HACKERS] Regexps vs. locale

2008-12-08 Thread Andrew Gierth
Tom == Tom Lane [EMAIL PROTECTED] writes: Andrew Gierth [EMAIL PROTECTED] writes: Obviously, this happens because the locale support functions in backend/regex/regc_locale.c are (presumably intentionally) crippled so as not to support non-ascii chars, despite all the code there using