Howdy folks, running into an issue extracting text from an .xlsx file.

I've used both my own extractor and the built in XSSFEventBasedExcelExtractor.  
In either case the output yields only data from number or date columns.  All 
character/string columns return null.  Example code for using the latter:

                public static void main (String args[]) throws XmlException, 
OpenXML4JException, IOException
                {
                                XSSFEventBasedExcelExtractor e = new 
XSSFEventBasedExcelExtractor(
                                                                
"F:\\test_file.xlsx");

                                String a = e.getText();

                                System.out.println(a);
                }

Output is similar to:
400403  11/16/10

The file has 49 columns with a lot of other text/character data.

I've used both the most recent POI 3.7 release and POI 3.7-beta1.

Interesting factoid is when I open the file in Excel 2007, save it without 
making any changes, and rerun the extraction utilities... they work fine.  I 
encountered something similar in HSSF with older versions of Excel files.  
Possible this is a similar problem?

Has anyone else seen this issue?

I'm unable to upload the problem Excel file as it has client data.  The file is 
enormous (270MB when I unzip the .xlsx) also.  This is to say...to troubleshoot 
this issue is it possible for me to send a portion of the extracted ooxml data, 
and if so...what portion do you need?

Thanks for any help,
-Matt

Reply via email to