Thibaut Britz <[EMAIL PROTECTED]> writes:

> the following produces a segfault using the latest version of libwww. 

I see segfaults with ActivePerl 810 but not with our latests builds.
What version of perl are you using?  The segfault appears to be a bug
in perl I would like to find out if the problem has really been fixed.

> As it seems, HTML::Parser is marking non UTF8 strings as UTF8 strings.

Did you enable the Unicode support when you installed HTML-Parser?  It
seems like this would be the only time this happens, but I want to be
sure.

> or to see it:
> 
> #!/usr/bin/perl
> use warnings;
> use strict;
> use Devel::Peek;
> use HTML::Parser;
> my $html = qq{<img title="&rsquo;\260">};
> my $p = HTML::Parser->new(api_version=>3,start_h=>[sub{Dump(shift-
> >{title})}, "attr"]);
> $p->parse($html);

What output do you get?

Reply via email to