https://bugs.freedesktop.org/show_bug.cgi?id=77656

Maxim Monastirsky <[email protected]> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
             Status|NEEDINFO                    |NEW
                 CC|                            |[email protected]
            Summary|FILEOPEN DOC: <!DOCTYPE     |FILEOPEN: HTML with
                   |html>                       |<!DOCTYPE html> and UTF-8
                   |                            |BOM isn't detected

--- Comment #4 from Maxim Monastirsky <[email protected]> ---
@Julien: Ivan probably expects that it will open as HTML document, not as a
Writer document showing the HTML code. In that case I can confirm this bug with
4.2 branch. Fortunately it's fixed in master by the changes I've made to the
HTML detection there. But unfortunately it's a big change and unlikely to be
backported to 4.2.

If someone wants to work on a fix for 4.2: The problem here is that this file
begins with a UTF-8 BOM (http://en.wikipedia.org/wiki/Byte_order_mark#UTF-8),
but HTMLParser::IsHTMLFormat (in svtools/source/svhtml/parhtml.cxx) doesn't
respect that kind of BOM, only the UTF-16 one. We need to simply skip it, the
same way we do for UTF-16.

-- 
You are receiving this mail because:
You are the assignee for the bug.
_______________________________________________
Libreoffice-bugs mailing list
[email protected]
http://lists.freedesktop.org/mailman/listinfo/libreoffice-bugs

Reply via email to