Seva Alekseyev created TIKA-2213:
------------------------------------
Summary: ArrayIndexOutOfBoundsException on a valid Word file
Key: TIKA-2213
URL: https://issues.apache.org/jira/browse/TIKA-2213
Project: Tika
Issue Type: Bug
Components: parser
Affects Versions: 1.14
Environment: Windows 7 x64, JVM 1.8.0_101
Reporter: Seva Alekseyev
On the attached file, which opens in Word, Tika parser throws the following
error:
java.lang.ArrayIndexOutOfBoundsException:
at java.lang.System.arraycopy:-2
at org.apache.poi.hwpf.model.TextPieceTable.<init>:109
at org.apache.poi.hwpf.model.ComplexFileTable.<init>:70
at org.apache.poi.hwpf.HWPFOldDocument.<init>:68
at org.apache.tika.parser.microsoft.WordExtractor.parseWord6:642
at org.apache.tika.parser.microsoft.WordExtractor.parse:153
at org.apache.tika.parser.microsoft.OfficeParser.parse:169
at org.apache.tika.parser.microsoft.OfficeParser.parse:130
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)