On Sep 24, 2011, at 5:00 AM, xml-requ...@gnome.org wrote: >> "should be able to parse "real world" HTML, even if severely broken from a >> specification point of view" > > It's documented, therefore it can be called a feature, not a bug :) > > Csaba
Sounds like a cop-out. 1. What is the point of parsing HTML if it is treated like dumb XML without an intelligent spec-aware layer on top of that (at least optionally)? 2. What alternative do people have to libXML if they want to generate a “real” DOM from HTML? 3. Why document an “HTML” mode if it isn’t really parsing HTML? Per HTML, there really ought to be a TBODY node, no ifs, ands, or buts. ajh _______________________________________________ xml mailing list, project page http://xmlsoft.org/ xml@gnome.org http://mail.gnome.org/mailman/listinfo/xml