I am parsing a whole of imperfectly formed HTML.  In doing so, about
1600 errors are generated and dumped to the log/console every time I
parse this feed.  Yet it parses fine with the recover option.  But the
options for supressing the messages don't appear to work.

Here's my code:

parser = LibXML::XML::HTMLParser.string(Downloader.get
(url).clean_html,
  :options => LibXML::XML::HTMLParser::Options::RECOVER |
    LibXML::XML::HTMLParser::Options::NOERROR |
    LibXML::XML::HTMLParser::Options::NOWARNING)
page = parser.parse.root


Is this the right way to declare the options for supressing errors and
warnings?  Why aren't they actually supressed.  I would worry too much
about this normally, but it appears after some profiling it appears
printing and handling these errors takes about 20-40% of my total
request time in my rails app, so its a fairly large performance issue.
_______________________________________________
libxml-devel mailing list
libxml-devel@rubyforge.org
http://rubyforge.org/mailman/listinfo/libxml-devel

Reply via email to