Hi all,

I have the following script (just a test):

---
#!/usr/bin/perl -w

use strict;
use HTML::Entities;

my $string = 'Alfred Döblin: Berlin Alexanderplatz';
my $string2 = 'Alfred Döblin: Berlin Alexanderplatz';

$string  = decode_entities($string);

print $string, "\n", $string2, "\n";
---

This prints

Alfred D?blin: Berlin Alexanderplatz
Alfred Döblin: Berlin Alexanderplatz

in my terminal

Now the perldoc for HTML::Entities says

>decode_entities( $string )
>This routine replaces HTML entities found in the $string with the
>corresponding ISO-8859-1 character, and if possible (under perl 5.8
>or later) will replace to Unicode characters.  Unrecognized enti-
>ties are left alone.

I do have Perl 5.8.1, so I'd expect the decode_entities method to return a Unicode 
character string. Why doesn't it do that?

Thanks,

Jan
-- 
A common mistake that people make when trying to design something completely foolproof 
is to underestimate the ingenuity of complete fools.

--
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]
<http://learn.perl.org/> <http://learn.perl.org/first-response>


Reply via email to