https://issues.apache.org/bugzilla/show_bug.cgi?id=49933
Summary: Word 6/95 documents with sections cause
ArrayIndexOutOfBoundsException
Product: POI
Version: 3.7-dev
Platform: PC
OS/Version: Linux
Status: NEW
Severity: normal
Priority: P2
Component: HWPF
AssignedTo: [email protected]
ReportedBy: [email protected]
Created an attachment (id=26027)
--> (https://issues.apache.org/bugzilla/attachment.cgi?id=26027)
Word 95 document with a section
Processing a word 6/word 95 document with sections causes
ArrayIndexOutOfBoundsException. Tika (Revision: 997224, 2010-09-14) with
3.7-beta2 POI dependency on the attached document gives rise to:
Exception in thread "main" org.apache.tika.exception.TikaException: Unexpected
RuntimeException from org.apache.tika.parser.microsoft.officepar...@1e7c5cb
at
org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:165)
at
org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:146)
at org.apache.tika.cli.TikaCLI.process(TikaCLI.java:197)
at org.apache.tika.cli.TikaCLI.main(TikaCLI.java:71)
Caused by: java.lang.ArrayIndexOutOfBoundsException: 22
at org.apache.poi.util.LittleEndian.getShort(LittleEndian.java:46)
at org.apache.poi.hwpf.sprm.SprmOperation.<init>(SprmOperation.java:54)
at org.apache.poi.hwpf.sprm.SprmIterator.next(SprmIterator.java:45)
at
org.apache.poi.hwpf.sprm.SectionSprmUncompressor.uncompressSEP(SectionSprmUncompressor.java:36)
at org.apache.poi.hwpf.model.SEPX.<init>(SEPX.java:33)
at
org.apache.poi.hwpf.model.OldSectionTable.<init>(OldSectionTable.java:61)
at org.apache.poi.hwpf.HWPFOldDocument.<init>(HWPFOldDocument.java:103)
at org.apache.poi.hwpf.HWPFOldDocument.<init>(HWPFOldDocument.java:42)
at
org.apache.tika.parser.microsoft.WordExtractor.parseWord6(WordExtractor.java:150)
at
org.apache.tika.parser.microsoft.WordExtractor.parse(WordExtractor.java:51)
at
org.apache.tika.parser.microsoft.OfficeParser.parse(OfficeParser.java:187)
at
org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:163)
--
Configure bugmail: https://issues.apache.org/bugzilla/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are the assignee for the bug.
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]