"Gil Vidals" <[EMAIL PROTECTED]> writes:

> I have been using HTML::Parser for a few years now and would like to resolve
> the issue of processing malformed HTML -- that is missing start and end
> tags. In particular, I'm running many web pages which are missing the
> closing </A> and </SCRIPT> tags.
>  
> Is there is an easy way for HTML::Parser to insert implied tags such as what
> is now done by HTML::TreeBuilder and HTML::Element????

No, but if you can come up with simple rules for when the missing tags
should be inserted then writing a wrapper should be easy enough :)

Missing </SCRIPT> can be a challenge because HTML::Parser will just
report the rest of the document as text.  You would have to find a
suitable place to restart parsing within this text and then perhaps
start off a new HTML::Parser instance there.

I would just use HTML::TreeBuilder :)

Regards,
Gisle

Reply via email to