Doh! It looks like its time to eat humble pie. It turns out that the guy here who has 7.3.4 and helped me to reproduce the problem did not follow our own installation instructions (that he recently re-worded!) as follows:
"createdb -E UNICODE -U DB_USER -P DB_PASSWORD DB_NAME" and did not set the encoding. I, like a good boy, did on my 7.2 installation. The guys I am trying to debug the problem for are in another location and are using 7.3.4 too. Hence I narrowed it down to a version problem. I am asking them to check the encoding on their database too and will post back with huge apologies and thanks for your time when they inevitably confirm that the encoding is SQL_ANSI. Thanks, Matty. ----- Original Message ----- From: "Matthew Cooper" <[EMAIL PROTECTED]> To: "Tom Lane" <[EMAIL PROTECTED]> Cc: <[EMAIL PROTECTED]> Sent: Monday, September 15, 2003 9:50 AM Subject: Re: [BUGS] 7.3.2 incorrectly counts characters for unicode varchar field > Attached is the UTF-8 encoded sql file in case it got messed up in the mail > transfer. > > And here it is pasted in directly from the window that was displaying > chinese characters. > > insert into mgc values ('分钟练习分钟练习练习'); > > > Looking at the UTF-8 documentation, 10 chinese characters could be any > number of bytes, each character being say 2 or 3 characters. > > Matty. > ----- Original Message ----- > From: "Tom Lane" <[EMAIL PROTECTED]> > To: "Matthew Cooper" <[EMAIL PROTECTED]> > Cc: <[EMAIL PROTECTED]> > Sent: Saturday, September 13, 2003 5:51 PM > Subject: Re: [BUGS] 7.3.2 incorrectly counts characters for unicode varchar > field > > > > > insert into mgc values ('Ã¥Ë?â? éâ?TŸç»Æ'ä¹ Ã¥Ë?â? > éâ?TŸç»Æ'ä¹ ç»Æ'ä¹ '); > > > > I don't think this string is correctly unicode-encoded. Anyway "length" > > claims it is 30 characters. > > > > regards, tom lane > > > ---------------------------(end of broadcast)--------------------------- TIP 5: Have you checked our extensive FAQ? http://www.postgresql.org/docs/faqs/FAQ.html