On Fri, Sep 23, 2011 at 9:05 AM, Glen Hein wrote: > On Fri, 2011-09-23 at 08:44 +0200, Ralf Junker wrote: > > On 23.09.2011 08:21, Alex Bligh wrote: > >> libxml parses XML not HTML. > > Wrong. libxml parses XML _and_ HTML. Documented here: > > http://www.xmlsoft.org/html/libxml-HTMLparser.html > > Yes, but libxml doesn't claim to be the world's best html parser: > > "should be able to parse "real world" HTML, even if severely broken from a > specification point of view"
It's documented, therefore it can be called a feature, not a bug :) Csaba -- GCS a+ e++ d- C++ ULS$ L+$ !E- W++ P+++$ w++$ tv+ b++ DI D++ 5++ The Tao of math: The numbers you can count are not the real numbers. Life is complex, with real and imaginary parts. "Ok, it boots. Which means it must be bug-free and perfect. " -- Linus Torvalds "People disagree with me. I just ignore them." -- Linus Torvalds _______________________________________________ xml mailing list, project page http://xmlsoft.org/ xml@gnome.org http://mail.gnome.org/mailman/listinfo/xml