On Fri, 30 Sep 2005 04:25, John Talbot wrote: > I just added 'use utf8;' in my script, and it worked! > > That's strange again, I thought these libraries would just flush > whatever's on their input to the Jabber server. What's the difference I > mean if Perl thinks a text is 5 characters or 10, as long as it flushes > everything?
Darn, this trickled in right after I wrote the other reply. :-)
The answer to this is...
If Perl thought the original file was, say, ISO-8859-1, the accented
characters would have been read as those accented characters. The XML stream
being sent to the Jabber server has to be in UTF-8, so it then proceeds to
encode those characters as two bytes each.
So I guess what you were seeing come out the other end was the result of that
encoding it was adding.
TX
--
Email: Trejkaz Xaoza <[EMAIL PROTECTED]>
Web site: http://trypticon.org/
Jabber ID: [EMAIL PROTECTED]
GPG Fingerprint: 9EEB 97D7 8F7B 7977 F39F A62C B8C7 BC8B 037E EA73
pgpQcEa4NGh79.pgp
Description: PGP signature
