[GENERAL] Unicode database on non-unicode operating system

2008-07-15 Thread Morten Barklund
Hi, I have a PostgreSQL 8.0.3 running on an older debian server and have some problems with unicode databases and character conversions. First up, some backgrund info about my server and installation: test=# \set VERSION = 'PostgreSQL 8.0.3 on i386-pc-linux-gnu, compiled by GCC cc

Re: [GENERAL] Unicode database on non-unicode operating system

2008-07-15 Thread Peter Eisentraut
Am Dienstag, 15. Juli 2008 schrieb Morten Barklund: My problem is, that the lowercase versions of non-ascii characters are broken. Specifically I found, that when lower() is invoked on a text with non-ascii characters, the operating system's locale is used for converting each octet in the

Re: [GENERAL] Unicode database on non-unicode operating system

2008-07-15 Thread Morten Barklund
-Original Message- From: Peter Eisentraut [mailto:[EMAIL PROTECTED] Sent: Tuesday, July 15, 2008 2:33 PM To: pgsql-general@postgresql.org Cc: Morten Barklund Subject: Re: [GENERAL] Unicode database on non-unicode operating system Am Dienstag, 15. Juli 2008 schrieb Morten Barklund: My

Re: [GENERAL] Unicode database on non-unicode operating system

2008-07-15 Thread Peter Eisentraut
Am Dienstag, 15. Juli 2008 schrieb Morten Barklund: I can see that lc_collate (sorting) and lc_ctype (lower-upper conversion) is set to en_DK and I guess that default encoding for en_DK is iso88591 or maybe windows1252. It is ISO-8859-1. There is no support for Windows charmaps on Linux.

Re: [GENERAL] Unicode database on non-unicode operating system

2008-07-15 Thread Morten Barklund
: Tuesday, July 15, 2008 3:50 PM To: pgsql-general@postgresql.org Cc: Morten Barklund Subject: Re: [GENERAL] Unicode database on non-unicode operating system Am Dienstag, 15. Juli 2008 schrieb Morten Barklund: I can see that lc_collate (sorting) and lc_ctype (lower-upper conversion) is set to en_DK and I