[
https://issues.apache.org/jira/browse/TIKA-1956?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15247637#comment-15247637
]
Tim Allison commented on TIKA-1956:
-----------------------------------
Thank you for raising this issue and supplying a triggering document!
We were trusting that POI would return a non-null value from
{{field.getMarkEndCharacterRun(r)}} on which we called {{.getPicOffset()}} to
label the attachment.
I added a null check, and the full document is parsed. I'll commit this
shortly.
Thank you, again.
> NPE in WordParser when trying to getPicOffset
> ---------------------------------------------
>
> Key: TIKA-1956
> URL: https://issues.apache.org/jira/browse/TIKA-1956
> Project: Tika
> Issue Type: Bug
> Components: parser
> Affects Versions: 1.11
> Environment: Ubuntu 14.04, tika-server-1.11.jar,
> Reporter: Ramit Wadhwa
> Assignee: Tim Allison
>
> Tika-server gives 422 error:
> /rmeta throws 422 error,
> /tika gives text but its partial,
> The text is parsed till beginning of an image.
> This is the last text which is parsed.
> <h4>17.5.7.1
> BM-SC Initiated Multicast Service Deactivation
> </h4>
> <p>
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)