Seva Alekseyev created TIKA-2185:
------------------------------------
Summary: NegativeArraySizeException on a valid Word file
Key: TIKA-2185
URL: https://issues.apache.org/jira/browse/TIKA-2185
Project: Tika
Issue Type: Bug
Components: parser
Affects Versions: 1.14
Environment: Windows 7 x64, JVM 1.8.0_101
Reporter: Seva Alekseyev
Attachments: PatentW final.doc
On the attached document, which opens fine with Word, the Tika parser throws
the following:
java.lang.NegativeArraySizeException:
at org.apache.poi.hwpf.model.StyleDescription.<init>:122
at org.apache.poi.hwpf.model.StyleSheet.<init>:107
at org.apache.poi.hwpf.HWPFDocument.<init>:289
at org.apache.tika.parser.microsoft.WordExtractor.parse:151
at org.apache.tika.parser.microsoft.OfficeParser.parse:169
at org.apache.tika.parser.microsoft.OfficeParser.parse:130
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)