https://bugs.documentfoundation.org/show_bug.cgi?id=162186
Bug ID: 162186
Summary: FILEOPEN DOC html file with 3 or 4 consecutive "é"
chararcter
Product: LibreOffice
Version: 7.3.7.2 release
Hardware: x86-64 (AMD64)
OS: Linux (All)
Status: UNCONFIRMED
Severity: normal
Priority: medium
Component: Writer
Assignee: [email protected]
Reporter: [email protected]
Description:
I create a ".doc" file with html content, UTF-8 charset defined.
If the file contains 1 or 2 consecutive "é" character, LibreOffice open it
correctly.
But if there is 3 or 4 consecutive "é", the html parsing seem to fail and
LibreOffice display the raw html.
With 5 consecutive "é", it's back to normal.
Steps to Reproduce:
1. With a text editor, create an html with 3 consecutive "é" in the body
2. Save the document with ".doc" extension
3. Open with LibreOffice
Actual Results:
Display raw html
Expected Results:
Parse the html and convert it
Reproducible: Always
User Profile Reset: No
Additional Info:
Version: 7.3.7.2 / LibreOffice Community
Build ID: 30(Build:2)
CPU threads: 4; OS: Linux 5.19; UI render: default; VCL: gtk3
Locale: en-US (en_US.UTF-8); UI: en-US
Ubuntu package version: 1:7.3.7-0ubuntu0.22.04.3
Calc: threaded
--
You are receiving this mail because:
You are the assignee for the bug.