https://bugs.documentfoundation.org/show_bug.cgi?id=100089
Bug ID: 100089
Summary: FILESAVE: Excel files saved with libreoffice growing
to magnitudes bigger, unreadable from python
Product: LibreOffice
Version: 4.2.8.2 release
Hardware: All
OS: All
Status: UNCONFIRMED
Severity: normal
Priority: medium
Component: LibreOffice
Assignee: [email protected]
Reporter: [email protected]
Original file:
-rw-rw-r-- 1 mrjb mrjb 435131 May 19 11:37 Grocery Supplier
Portal_0.41jg4_20160519.xlsx
Loaded and saved with 4.2.8.2:
-rw-rw-r-- 1 mrjb mrjb 32855085 May 27 14:46 Grocery Supplier
Portal_0.41libreofficesaveonly.xlsx
Content copied and pasted to a fresh sheet:
-rw-rw-r-- 1 mrjb mrjb 374899 May 27 14:39 Grocery Supplier
Portal_0.41libreofficesavenew.xlsx
The merely saved file which is 100x the size of the original does not read
under Python with either the xlrd or openxlsx libraries; both crash. I even
tried patching xlrd to no avail.
Only after I read online discussion forums I read this may have to do with
empty cells being saved unnecessarily and only then I realized the libreoffice
saved file was 100x as big as the original.
Copying the content from the sheet to a new, fresh sheet and saving that
resulted in a nice small file which subsequently loaded fine in xlrd.
Though I have previously seen this being blamed on the xlrd library, the fact
that the file is rendered unreadable by several python libraries *and* grows to
100x its size merely by opening and saving it, this indicates something is
amiss on the libreoffice side in this version.
Unfortunately since these files contain potentially sensitive customer data I
cannot provide them as example attachment.
--
You are receiving this mail because:
You are the assignee for the bug._______________________________________________
Libreoffice-bugs mailing list
[email protected]
https://lists.freedesktop.org/mailman/listinfo/libreoffice-bugs