Tijl Houtbeckers wrote:
On Thu, 29 Sep 2005 17:57:04 +0200, John Talbot <[EMAIL PROTECTED]> wrote:


That is very surprising. Since Perl probably has nothing to do with the
unicode here, the culprit has to be jabberd then. I'll try to upgrade
(though I use the apt-get system for which the most recent versions
don't always exist).

Back up a second there ;) Perl is notorious for being bad with unicode, wether that's a repuation still deserved I don't know, but it was justified in the past. And afaik never in the history of jabberd has there been such a serious problem with UTF-8 handeling.

You are absolutely right. I tried using another public jabber server, before installing one myself, and same malfunction happened.
I even tried Psi, and that didn't make it work either.

I think you should consider first:
- is the file UTF-8? (you seem to have this covered)

Yes.

- is your version of Perl configured right to read unicode UTF-8files..

I'm not sure about this... Perl can get configured? During compile time you mean? Also I didn't think that Perl could have anything to do with this, because the libraries (Net::XMPP::etc and Net::Jabber::etc) don't contain the string 'utf8' anywhere, so I was guessing that these libraries were just passing whatever data they found inside the <body> tags without regard for utf8 compliance... but I guess I was wrong?

- *and* to use unicode for string handeling by default

No, it doesn't have that (and 5.8.6 has that?). It has got to be Perl's fault, but perhaps there's a way to avoid installing a second version of Perl on my system (I've got to keep the old one too, so many .debs are dependent on it in some way) - is this advice you're giving accurate? i.e. are newer versions of Perl handling unicode by default? If so, maybe I can just plug a 'use utf8;' command or type-in some CLI switch and make it all happen...

And what do you mean by Perl being able to read unicode files right? Aren't utf8 files just a series of bytes in Perl's eyes, just like any other file?

- do you have the most recent Net::Jabber

Yes, and there's only two versions of Net::Jabber (0.1 and 1.0) so all have the same.

Not critizing you or anything, but I'm worried you'll loose a lot of time accomplishing very little trying to find a UTF-8 bug in jabberd. Jabberd's unicode handeling is independant of how your system is setup, how you build it, etc. The same can not be said for Perl. If you're still not convinced it's not jabberd, try another server (a public one).

Yes, and thanks, you did save me. Installing jabberd 1.4.4 was the next thing I was going to do... but it's Perl's fault actually :-)

Reply via email to