Hello,

Rick Measham wrote:
> My thought process is to convert them into utf8 and store that in the
> database. Then it's just a case of retrieving them later and outputting
> them all on one page marked as utf8.
> 
> That being the case, I grab the charset and use Encode's decode function
> to turn it into 'perl's internal format' .. which in 5.8.5 is utf8
> right? I then store that in the db.
> 
> However it's not working.
> 
> Does that mean that the encoding of the actual characters on the page is
> not in the charset in the meta tag? Or am I missing some piece of the
> puzzle?

Please check this sample. Did you handle Perl's internal format in such
like manner?

#!/usr/local/bin/perl -w
use 5.008;
use strict;
use warnings;

use LWP::Simple;
my $uri = 'http://www.reitsport-schill.de/index1053542873.html';
my $remotedoc = get($uri);
$remotedoc =~ m/<title>(.*?)<\/title>/;
my $ISO_8859_1 = $1;

use Encode;
my $Perl_Internal = decode('ISO-8859-1', $ISO_8859_1);
my $UTF8 = encode('UTF8', $Perl_Internal);
print $UTF8;
__END__

-- 
Masanori HATA
<[EMAIL PROTECTED]>
He's always with us!

Reply via email to