[
https://issues.apache.org/jira/browse/TIKA-1119?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13657751#comment-13657751
]
Nick Burch commented on TIKA-1119:
----------------------------------
If you open the file and do a save-as, does that fix it? (Just wondering if
it's a duff file that office can silently ignore the errors on, or a valid
picture that HSLF is messing up the reading of)
> HSLFExtractor throws if PictureData is not readable
> ---------------------------------------------------
>
> Key: TIKA-1119
> URL: https://issues.apache.org/jira/browse/TIKA-1119
> Project: Tika
> Issue Type: Bug
> Components: parser
> Affects Versions: 1.3
> Environment: MAC and Ubuntu server tested
> Reporter: Lee Graber
>
> Unfortunately the repro file contains customer sensitive information and
> modifying it has eliminated the repro.
> In handleSlideEmbeddedPictures, the pic.getData() call can throw (in my case
> I got "javax.imageio.IIOException: Error reading PNG image data"). Ideally
> the parser would not be causing this but should this cause the whole parsing
> stage to fail? The file itself opens fine in Office.
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira