https://issues.apache.org/bugzilla/show_bug.cgi?id=53380

          Priority: P2
            Bug ID: 53380
          Assignee: [email protected]
           Summary: ArrayIndexOutOfBounds Excetion parsing word 97
                    document
          Severity: major
    Classification: Unclassified
          Reporter: [email protected]
          Hardware: Macintosh
            Status: NEW
           Version: unspecified
         Component: HDF
           Product: POI

Created attachment 28901
  --> https://issues.apache.org/bugzilla/attachment.cgi?id=28901&action=edit
offending word doc

Out of bounds exception occurs (stack trace below) when parsing attached word
97 doc


Exception in thread "main" org.apache.tika.exception.TikaException: Unexpected
RuntimeException from org.apache.tika.parser.microsoft.OfficeParser@393e6226
    at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:244)
    at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:242)
    at org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:120)
    at org.apache.tika.cli.TikaCLI$OutputType.process(TikaCLI.java:133)
    at org.apache.tika.cli.TikaCLI.process(TikaCLI.java:400)
    at org.apache.tika.cli.TikaCLI.main(TikaCLI.java:101)
Caused by: java.lang.ArrayIndexOutOfBoundsException: 18
    at org.apache.poi.util.LittleEndian.getInt(LittleEndian.java:163)
    at org.apache.poi.hwpf.model.Colorref.<init>(Colorref.java:81)
    at
org.apache.poi.hwpf.model.types.SHDAbstractType.fillFields(SHDAbstractType.java:56)
    at
org.apache.poi.hwpf.usermodel.ShadingDescriptor.<init>(ShadingDescriptor.java:38)
    at
org.apache.poi.hwpf.sprm.CharacterSprmUncompressor.unCompressCHPOperation(CharacterSprmUncompressor.java:582)
    at
org.apache.poi.hwpf.sprm.CharacterSprmUncompressor.uncompressCHP(CharacterSprmUncompressor.java:65)
    at org.apache.poi.hwpf.model.StyleSheet.createChp(StyleSheet.java:288)
    at org.apache.poi.hwpf.model.StyleSheet.<init>(StyleSheet.java:121)
    at org.apache.poi.hwpf.HWPFDocument.<init>(HWPFDocument.java:346)
    at
org.apache.tika.parser.microsoft.WordExtractor.parse(WordExtractor.java:77)
    at
org.apache.tika.parser.microsoft.OfficeParser.parse(OfficeParser.java:185)
    at
org.apache.tika.parser.microsoft.OfficeParser.parse(OfficeParser.java:160)
    at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:242)
    ... 5 more

-- 
You are receiving this mail because:
You are the assignee for the bug.

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to