Seva Alekseyev created TIKA-2135:
------------------------------------

             Summary: "msofbtOPT can have only '0x3' version" on an Excel file
                 Key: TIKA-2135
                 URL: https://issues.apache.org/jira/browse/TIKA-2135
             Project: Tika
          Issue Type: Bug
          Components: parser
    Affects Versions: 1.13
         Environment: Windows 7 x64, JVM 1.8.0_101
            Reporter: Seva Alekseyev
         Attachments: MAPSTATS.XLS

On the attached file, which opens fine in Excel, the Tika parser throws the 
following error:

java.lang.IllegalArgumentException: msofbtOPT can have only '0x3' version
        at 
org.apache.poi.ddf.EscherOptRecord.setVersion(EscherOptRecord.java:70)
        at org.apache.poi.ddf.EscherRecord.setOptions(EscherRecord.java:136)
        at 
org.apache.poi.ddf.DefaultEscherRecordFactory.createRecord(DefaultEscherRecordFactory.java:108)
        at 
org.apache.poi.ddf.EscherContainerRecord.fillFields(EscherContainerRecord.java:77)
        at 
org.apache.poi.hssf.record.AbstractEscherHolderRecord.convertToEscherRecords(AbstractEscherHolderRecord.java:82)
        at 
org.apache.poi.hssf.record.AbstractEscherHolderRecord.decode(AbstractEscherHolderRecord.java:261)
        at 
org.apache.tika.parser.microsoft.ExcelExtractor$TikaHSSFListener.processFile(ExcelExtractor.java:320)
        at 
org.apache.tika.parser.microsoft.ExcelExtractor.parse(ExcelExtractor.java:169)
        at 
org.apache.tika.parser.microsoft.OfficeParser.parse(OfficeParser.java:154)
        at 
org.apache.tika.parser.microsoft.OfficeParser.parse(OfficeParser.java:117)



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to