Byron Miller wrote:

Any chance of getting that rolled in? Has a few fixes
that look good.

Did you try using TagSoup? Some time ago I added to parse-html the support for using TagSoup instead of NekoHTML (this is an option in the config file). I found that in many cases TagSoup gives much better results, especially for pages with multiple <html> or <body> elements, where neko would give up...

Best regards,
Andrzej Bialecki     <><
___. ___ ___ ___ _ _   __________________________________
[__ || __|__/|__||\/|  Information Retrieval, Semantic Web
___|||__||  \|  ||  |  Embedded Unix, System Integration  Contact: info at sigram dot com

Reply via email to