https://bugs.freedesktop.org/show_bug.cgi?id=77656
Maxim Monastirsky <[email protected]> changed: What |Removed |Added ---------------------------------------------------------------------------- Status|NEEDINFO |NEW CC| |[email protected] Summary|FILEOPEN DOC: <!DOCTYPE |FILEOPEN: HTML with |html> |<!DOCTYPE html> and UTF-8 | |BOM isn't detected --- Comment #4 from Maxim Monastirsky <[email protected]> --- @Julien: Ivan probably expects that it will open as HTML document, not as a Writer document showing the HTML code. In that case I can confirm this bug with 4.2 branch. Fortunately it's fixed in master by the changes I've made to the HTML detection there. But unfortunately it's a big change and unlikely to be backported to 4.2. If someone wants to work on a fix for 4.2: The problem here is that this file begins with a UTF-8 BOM (http://en.wikipedia.org/wiki/Byte_order_mark#UTF-8), but HTMLParser::IsHTMLFormat (in svtools/source/svhtml/parhtml.cxx) doesn't respect that kind of BOM, only the UTF-16 one. We need to simply skip it, the same way we do for UTF-16. -- You are receiving this mail because: You are the assignee for the bug.
_______________________________________________ Libreoffice-bugs mailing list [email protected] http://lists.freedesktop.org/mailman/listinfo/libreoffice-bugs
