https://bz.apache.org/bugzilla/show_bug.cgi?id=59746

            Bug ID: 59746
           Summary: Unable to correctly read data in XLSX file
           Product: POI
           Version: 3.14-FINAL
          Hardware: PC
            Status: NEW
          Severity: normal
          Priority: P2
         Component: XSSF
          Assignee: [email protected]
          Reporter: [email protected]

I have an excel xlsx file that was exported by some mass spec software
(ThemoScientific I think) that looks to contain hierarchical or grouped data.
There are two examples of these files in the attached, one that is the full
export and one where only the top level data is exported. If you try to open
these files using POI, the file opens correctly, but the lastRowNum for the
only sheet is -1 (example code is also in the attachment). However, excel can
open the file and if you immediately hit save on the file (no changes needed)
and then try to open with POI, the file opens and the correct number of rows
are found. 

Looking at the XLM for the saved file, there are definitely changes that excel
has made, but I am not sure where POI is failing in parsing the original XML.
Not sure where the problem lies, but I have found a python package
(https://pypi.python.org/pypi/xlrd) that will correctly read the files that POI
is unable to read, so the XML does not appear to be completely nonstandard (or
only one that excel itself can read). 

This appears still be a problem with 3.15-beta1 as well.

-- 
You are receiving this mail because:
You are the assignee for the bug.

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to