Marco:

I think you are converting twice:

> # output will be utf8
> binmode(STDOUT, ":utf8");
>...
>                 from_to($html_text,$charset,"utf8");
>...

Here, it will convert html_text to utf-8 again because of binmode with
utf-8:

>                 print "CURRENT URL $url\n$html_text\n";

I think you can just remove the binmode line and it will work.

>Why do encodings always cause so much pain?

I hope this helps today's pain, at least :-).

Regards,

=Ed

Reply via email to