I am parsing a whole of imperfectly formed HTML. In doing so, about
1600 errors are generated and dumped to the log/console every time I
parse this feed. Yet it parses fine with the recover option. But the
options for supressing the messages don't appear to work.
Here's my code:
parser = LibXML::XML::HTMLParser.string(Downloader.get
(url).clean_html,
:options => LibXML::XML::HTMLParser::Options::RECOVER |
LibXML::XML::HTMLParser::Options::NOERROR |
LibXML::XML::HTMLParser::Options::NOWARNING)
page = parser.parse.root
Is this the right way to declare the options for supressing errors and
warnings? Why aren't they actually supressed. I would worry too much
about this normally, but it appears after some profiling it appears
printing and handling these errors takes about 20-40% of my total
request time in my rails app, so its a fairly large performance issue.
_______________________________________________
libxml-devel mailing list
[email protected]
http://rubyforge.org/mailman/listinfo/libxml-devel