https://bugs.documentfoundation.org/show_bug.cgi?id=162186

            Bug ID: 162186
           Summary: FILEOPEN DOC html file with 3 or 4 consecutive "é"
                    chararcter
           Product: LibreOffice
           Version: 7.3.7.2 release
          Hardware: x86-64 (AMD64)
                OS: Linux (All)
            Status: UNCONFIRMED
          Severity: normal
          Priority: medium
         Component: Writer
          Assignee: [email protected]
          Reporter: [email protected]

Description:
I create a ".doc" file with html content, UTF-8 charset defined.
If the file contains 1 or 2 consecutive "é" character, LibreOffice open it
correctly.
But if there is 3 or 4 consecutive "é", the html parsing seem to fail and
LibreOffice display the raw html.
With 5 consecutive "é", it's back to normal.

Steps to Reproduce:
1. With a text editor, create an html with 3 consecutive "é" in the body
2. Save the document with ".doc" extension
3. Open with LibreOffice

Actual Results:
Display raw html

Expected Results:
Parse the html and convert it


Reproducible: Always


User Profile Reset: No

Additional Info:
Version: 7.3.7.2 / LibreOffice Community
Build ID: 30(Build:2)
CPU threads: 4; OS: Linux 5.19; UI render: default; VCL: gtk3
Locale: en-US (en_US.UTF-8); UI: en-US
Ubuntu package version: 1:7.3.7-0ubuntu0.22.04.3
Calc: threaded

-- 
You are receiving this mail because:
You are the assignee for the bug.

Reply via email to