"Gil Vidals" <[EMAIL PROTECTED]> writes: > I have been using HTML::Parser for a few years now and would like to resolve > the issue of processing malformed HTML -- that is missing start and end > tags. In particular, I'm running many web pages which are missing the > closing </A> and </SCRIPT> tags. > > Is there is an easy way for HTML::Parser to insert implied tags such as what > is now done by HTML::TreeBuilder and HTML::Element????
No, but if you can come up with simple rules for when the missing tags should be inserted then writing a wrapper should be easy enough :) Missing </SCRIPT> can be a challenge because HTML::Parser will just report the rest of the document as text. You would have to find a suitable place to restart parsing within this text and then perhaps start off a new HTML::Parser instance there. I would just use HTML::TreeBuilder :) Regards, Gisle