https://bugs.documentfoundation.org/show_bug.cgi?id=98965
Bug ID: 98965
Summary: FILESAVE: Conversion from xls to xlsx fails due to
weird character inside a cell
Product: LibreOffice
Version: 5.0.5.2 release
Hardware: x86-64 (AMD64)
OS: Linux (All)
Status: UNCONFIRMED
Severity: normal
Priority: medium
Component: Calc
Assignee: [email protected]
Reporter: [email protected]
I have been trying to convert an xls table to an xlsx or csv table, but the
conversion fails without even providing a proper error message. I believe the
issue is due to a weird character inside a cell of the xls table.
Interestingly, the conversion to html works fine instead.
Steps to reproduce the problem:
1) Download problematic xls table:
# wget --user-agent=""
http://www.sciencedirect.com/science/MiamiMultiMediaURL/1-s2.0-S0140673614617050/1-s2.0-S0140673614617050-mmc2.xls/271074/html/S0140673614617050/8362fd82294e4f65e12123ba7f198387/mmc2.xls
2) Convert table to html:
# localc --convert-to html mmc2.xls
3) Identify problematic cell (sheet 5, row 129, column 9):
# grep "Nephrotic syndrome, type 2 ." mmc2.xls
# grep "Nephrotic syndrome, type 2 ." mmc2.html
4) Convert table to xlsx:
# localc --convert-to xlsx mmc2.xls
5) Open converted table
# localc mmc2.xlsx
You will notice that in sheet 5 column 4 and column 9 after row 129 are all
missing data. The file also seems to be malformed itself, as the following
command:
# xlsx2csv mmc2.xlsx
Fails with error "xml.parsers.expat.ExpatError: not well-formed (invalid
token): line 2, column 48431"
--
You are receiving this mail because:
You are the assignee for the bug.
_______________________________________________
Libreoffice-bugs mailing list
[email protected]
https://lists.freedesktop.org/mailman/listinfo/libreoffice-bugs