Seva Alekseyev created TIKA-2196:
------------------------------------

             Summary: IllegalArgumentException on a valid Excel file
                 Key: TIKA-2196
                 URL: https://issues.apache.org/jira/browse/TIKA-2196
             Project: Tika
          Issue Type: Bug
          Components: parser
    Affects Versions: 1.14
         Environment: Windows 7 x64, JVM 1.8.0_101
            Reporter: Seva Alekseyev
         Attachments: 2007 Experiment watch.xls

On the attached Excel file, which opens fine in Excel, Tika throws the 
following error:

java.lang.IllegalArgumentException: Cannot format given Object as a Number
        at java.text.DecimalFormat.format:-1
        at org.apache.poi.ss.usermodel.ExcelGeneralNumberFormat.format:67
        at java.text.Format.format:-1
        at org.apache.poi.ss.usermodel.DataFormatter.performDateFormatting:736
        at org.apache.poi.ss.usermodel.DataFormatter.formatRawCellContents:804
        at org.apache.poi.ss.usermodel.DataFormatter.formatRawCellContents:785
        at 
org.apache.poi.hssf.eventusermodel.FormatTrackingHSSFListener.formatNumberDateCell:143
        at 
org.apache.tika.parser.microsoft.ExcelExtractor$TikaHSSFListener$TikaFormatTrackingHSSFListener.formatNumberDateCell:633
        at 
org.apache.tika.parser.microsoft.ExcelExtractor$TikaHSSFListener.internalProcessRecord:405
        at 
org.apache.tika.parser.microsoft.ExcelExtractor$TikaHSSFListener.processRecord:336
        at 
org.apache.poi.hssf.eventusermodel.FormatTrackingHSSFListener.processRecord:92
        at org.apache.poi.hssf.eventusermodel.HSSFRequest.processRecord:109
        at 
org.apache.poi.hssf.eventusermodel.HSSFEventFactory.genericProcessEvents:179
        at org.apache.poi.hssf.eventusermodel.HSSFEventFactory.processEvents:136
        at 
org.apache.tika.parser.microsoft.ExcelExtractor$TikaHSSFListener.processFile:312
        at org.apache.tika.parser.microsoft.ExcelExtractor.parse:169
        at org.apache.tika.parser.microsoft.OfficeParser.parse:177
        at org.apache.tika.parser.microsoft.OfficeParser.parse:130




--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to