Hey guys,
The 'Unicode characters above 0x10000' issue keeps rearing its ugly head in the IRC channel. I propose that it be fixed, even backported...
This is John Hansen's most recent patch to fix it:
http://archives.postgresql.org/pgsql-patches/2004-11/msg00259.php
And from what I can tell it was committed, then reverted because it wasn't a "bug". It was going to go in for 8.1.
We on the channel are starting to think that it is in fact a bug. There are are people with legitimately utf-8 encoded XML documents that they cannot store in PostgreSQL. Apparently in the distant past, Unicode was limited to 0x10000, but then was extended.
Perhaps we can reopen this case...
Chris
---------------------------(end of broadcast)--------------------------- TIP 7: don't forget to increase your free space map settings