Byron Miller wrote:

http://people.apache.org/~andyc/neko/doc/html/changes.html

Any chance of getting that rolled in? Has a few fixes
that look good.

Did you try using TagSoup? Some time ago I added to parse-html the support for using TagSoup instead of NekoHTML (this is an option in the config file). I found that in many cases TagSoup gives much better results, especially for pages with multiple <html> or <body> elements, where neko would give up...

--
Best regards,
Andrzej Bialecki     <><
___. ___ ___ ___ _ _   __________________________________
[__ || __|__/|__||\/|  Information Retrieval, Semantic Web
___|||__||  \|  ||  |  Embedded Unix, System Integration
http://www.sigram.com  Contact: info at sigram dot com


Reply via email to