[ 
https://issues.apache.org/jira/browse/TIKA-485?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Antoni Mylka updated TIKA-485:
------------------------------

    Attachment: tika-truncated-excel-file.patch

a patch with a test that exposes the issue

> ContainerAwareDetector doesn't support truncated POI files
> ----------------------------------------------------------
>
>                 Key: TIKA-485
>                 URL: https://issues.apache.org/jira/browse/TIKA-485
>             Project: Tika
>          Issue Type: Improvement
>            Reporter: Antoni Mylka
>         Attachments: tika-truncated-excel-file.patch
>
>
> If a file has a POI magic number but the call to  new POIFSFileSystem(new 
> FileInputStream(stream.getFile())); throws an exception because the file is 
> broken - the entire process will fail. A simple try-catch around the call to 
> POIFSContainerDetector.detect would allow the ContainerAwareDetector to 
> return a meaningful result

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to