> It's already multibyte safe since 8.4 No, it doesn't.
$ psql test Pager usage is off. psql (8.4.4) Type "help" for help. test=# select similarity('abc', 'abd'); -- OK similarity ------------ 0.333333 (1 row) test=# select similarity('日本語', '日本後'); -- NG similarity ------------ NaN (1 row) test=# select show_trgm('abc'); -- OK show_trgm ------------------------- {" a"," ab",abc,"bc "} (1 row) test=# select show_trgm('日本語'); -- NG show_trgm ----------- {} (1 row) Encoding is EUC_JP, locale is C. Included is the script to reproduce the problem. -- Tatsuo Ishii SRA OSS, Inc. Japan English: http://www.sraoss.co.jp/index_en.php Japanese: http://www.sraoss.co.jp
pg_trgm.sql
Description: Binary data
-- Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org) To make changes to your subscription: http://www.postgresql.org/mailpref/pgsql-hackers