https://bugs.freedesktop.org/show_bug.cgi?id=43337
Bug #: 43337
Summary: The XML structure of ODF imports from DOC adds too
much noise
Classification: Unclassified
Product: LibreOffice
Version: LibO 3.4.1 release
Platform: All
OS/Version: All
Status: UNCONFIRMED
Severity: major
Priority: medium
Component: Libreoffice
AssignedTo: [email protected]
ReportedBy: [email protected]
Created attachment 53951
--> https://bugs.freedesktop.org/attachment.cgi?id=53951
original doc (Word) and converted odt (3.3.4/3.4.1/3.4.4) and docx (Word)
An ODF import of a DOC file (created with the LO top page) produced the
following structure in LO 3.3:
<text:p text:style-name="P3">Home of the LibreOffice Productivity
Suite</text:p>
In LO 3.4.1 (and 3.4.4) it produces:
<text:p text:style-name="P3">Home<text:span text:style-name="T1">
</text:span>of<text:span text:style-name="T1"> </text:span>the<text:span
text:style-name="T1"> </text:span>LibreOffice<text:span text:style-name="T1">
</text:span>Productivity<text:span text:style-name="T1">
</text:span>Suite</text:p>
which basically adds a style to every word and every space.
Although the user might not see the difference, processes that rely on the XML
structure of the document end up with having to deal with all that noise.
For exemple, L10N/translation software (either commercial or free) usually
relies on the XML structure of the document to create a similarly structured
translated file. If there is too much XML noise in the document the translator
(or parser) will not be able to properly handle the contents and may end up
loosing data or creating a invalid XML document.
--
Configure bugmail: https://bugs.freedesktop.org/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are the assignee for the bug.
_______________________________________________
Libreoffice-bugs mailing list
[email protected]
http://lists.freedesktop.org/mailman/listinfo/libreoffice-bugs