https://bugs.documentfoundation.org/show_bug.cgi?id=98965

            Bug ID: 98965
           Summary: FILESAVE: Conversion from xls to xlsx fails due to
                    weird character inside a cell
           Product: LibreOffice
           Version: 5.0.5.2 release
          Hardware: x86-64 (AMD64)
                OS: Linux (All)
            Status: UNCONFIRMED
          Severity: normal
          Priority: medium
         Component: Calc
          Assignee: [email protected]
          Reporter: [email protected]

I have been trying to convert an xls table to an xlsx or csv table, but the
conversion fails without even providing a proper error message. I believe the
issue is due to a weird character inside a cell of the xls table.
Interestingly, the conversion to html works fine instead.

Steps to reproduce the problem:
1) Download problematic xls table:
# wget --user-agent=""
http://www.sciencedirect.com/science/MiamiMultiMediaURL/1-s2.0-S0140673614617050/1-s2.0-S0140673614617050-mmc2.xls/271074/html/S0140673614617050/8362fd82294e4f65e12123ba7f198387/mmc2.xls
2) Convert table to html:
# localc --convert-to html mmc2.xls
3) Identify problematic cell (sheet 5, row 129, column 9):
# grep "Nephrotic syndrome, type 2 ." mmc2.xls
# grep "Nephrotic syndrome, type 2 ." mmc2.html
4) Convert table to xlsx:
# localc --convert-to xlsx mmc2.xls
5) Open converted table
# localc mmc2.xlsx

You will notice that in sheet 5 column 4 and column 9 after row 129 are all
missing data. The file also seems to be malformed itself, as the following
command:
# xlsx2csv mmc2.xlsx
Fails with error "xml.parsers.expat.ExpatError: not well-formed (invalid
token): line 2, column 48431"

-- 
You are receiving this mail because:
You are the assignee for the bug.
_______________________________________________
Libreoffice-bugs mailing list
[email protected]
https://lists.freedesktop.org/mailman/listinfo/libreoffice-bugs

Reply via email to