https://bugs.freedesktop.org/show_bug.cgi?id=80149
Priority: medium
Bug ID: 80149
Assignee: [email protected]
Summary: FILEOPEN: Control characters in imported CSV files
cause problems once saved as xlsx
Severity: normal
Classification: Unclassified
OS: Windows (All)
Reporter: [email protected]
Hardware: Other
Whiteboard: BSA
Status: UNCONFIRMED
Version: 4.2.4.2 release
Component: Spreadsheet
Product: LibreOffice
Created attachment 101245
--> https://bugs.freedesktop.org/attachment.cgi?id=101245&action=edit
minimal csv file containing a 0x16 character
Problem description:
Steps to reproduce:
1. Import a CSV file that includes a low-value control character (e.g. 0x16)
2. Save resulting spreadsheet in xlsx format
3. Attempt to re-open spreadsheet in LibreOffice or Excel
Current behavior:
The resulting .xlsx file is treated as corrupt (invalid UTF8) by Excel.
LibreOffice truncates columns after the corrupted cell when the xlsx file is
reloaded.
Expected behavior:
CSV import filter could reject the file; could strip out control characters or
XLSX export could use an encoding that coped with the corrupt characters.
Although the initial csv file is obviously malformed, such corrupt data exists
in the wild and is hard to detect. LibreOffice's silent truncation of columns
after the corrupt cell is problematic as it can easily be overlooked.
(in my original example rows after the corrupt cell stayed but with right-hand
columns missing - no error was reported by LibreOffice)
Operating System: Windows 7
Version: 4.2.4.2 release
--
You are receiving this mail because:
You are the assignee for the bug.
_______________________________________________
Libreoffice-bugs mailing list
[email protected]
http://lists.freedesktop.org/mailman/listinfo/libreoffice-bugs