[
https://issues.apache.org/jira/browse/TIKA-485?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Antoni Mylka updated TIKA-485:
------------------------------
Attachment: tika-truncated-excel-file.patch
a patch with a test that exposes the issue
> ContainerAwareDetector doesn't support truncated POI files
> ----------------------------------------------------------
>
> Key: TIKA-485
> URL: https://issues.apache.org/jira/browse/TIKA-485
> Project: Tika
> Issue Type: Improvement
> Reporter: Antoni Mylka
> Attachments: tika-truncated-excel-file.patch
>
>
> If a file has a POI magic number but the call to new POIFSFileSystem(new
> FileInputStream(stream.getFile())); throws an exception because the file is
> broken - the entire process will fail. A simple try-catch around the call to
> POIFSContainerDetector.detect would allow the ContainerAwareDetector to
> return a meaningful result
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.