https://bugs.freedesktop.org/show_bug.cgi?id=76021

--- Comment #8 from Tomaz Vajngerl <[email protected]> ---
I agree that HTML export in LO is reallybad, hasn't been worked on since
Netscape was king and it probably needs rewriting to better use CSS and SVG,
not use deprecated HTML features and to use new HTML5 tags where appropriate
(easily choosing between HTML4 and HTML5). This probably will take some time..

However, if you are trying to parse HTML with a XML parser then it is your own
fault. HTML is not XML - there are subtle differences like tags are case
sensitive in XML but on HTML, no need for "/" if element has no body (for
example: <br> is valid HTML but not XML) and nesting tags is allowed in HTML.
In other words: it is recommended today to write HTML as XML but not mandated
so you can not rely on that.

If you want a valid XML document export it as XHTML, which is actually using
XML as a base.

-- 
You are receiving this mail because:
You are the assignee for the bug.
_______________________________________________
Libreoffice-bugs mailing list
[email protected]
http://lists.freedesktop.org/mailman/listinfo/libreoffice-bugs

Reply via email to