https://bugs.freedesktop.org/show_bug.cgi?id=43337

             Bug #: 43337
           Summary: The XML structure of ODF imports from DOC adds too
                    much noise
    Classification: Unclassified
           Product: LibreOffice
           Version: LibO 3.4.1 release
          Platform: All
        OS/Version: All
            Status: UNCONFIRMED
          Severity: major
          Priority: medium
         Component: Libreoffice
        AssignedTo: [email protected]
        ReportedBy: [email protected]


Created attachment 53951
  --> https://bugs.freedesktop.org/attachment.cgi?id=53951
original doc (Word) and converted odt (3.3.4/3.4.1/3.4.4) and docx (Word)

An ODF import of a DOC file (created with the LO top page) produced the
following structure in LO 3.3:

<text:p text:style-name="P3">Home of the LibreOffice Productivity
Suite</text:p>

In LO 3.4.1 (and 3.4.4) it produces:

<text:p text:style-name="P3">Home<text:span text:style-name="T1">
</text:span>of<text:span text:style-name="T1"> </text:span>the<text:span
text:style-name="T1"> </text:span>LibreOffice<text:span text:style-name="T1">
</text:span>Productivity<text:span text:style-name="T1">
</text:span>Suite</text:p>

which basically adds a style to every word and every space.

Although the user might not see the difference, processes that rely on the XML
structure of the document end up with having to deal with all that noise.

For exemple, L10N/translation software (either commercial or free) usually
relies on the XML structure of the document to create a similarly structured
translated file. If there is too much XML noise in the document the translator
(or parser) will not be able to properly handle the contents and may end up
loosing data or creating a invalid XML document.

-- 
Configure bugmail: https://bugs.freedesktop.org/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are the assignee for the bug.
_______________________________________________
Libreoffice-bugs mailing list
[email protected]
http://lists.freedesktop.org/mailman/listinfo/libreoffice-bugs

Reply via email to