[
https://issues.apache.org/jira/browse/TIKA-2841?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tim Allison updated TIKA-2841:
------------------------------
Description: We've done some work on this with docx, etc, but we can do
more with epub and open office, and, frankly msoffice as well. We should also
improve the ContainerDetector to work more robustly with truncated zips. (was:
We've done some work on this with docx, etc, but we can do more with epub and
open office, and, frankly msoffice as well.)
> Improve robustness of parsers of zip-based files on truncated files
> -------------------------------------------------------------------
>
> Key: TIKA-2841
> URL: https://issues.apache.org/jira/browse/TIKA-2841
> Project: Tika
> Issue Type: Task
> Reporter: Tim Allison
> Priority: Major
> Attachments: truncated_10000.zip, truncated_30000.zip
>
>
> We've done some work on this with docx, etc, but we can do more with epub and
> open office, and, frankly msoffice as well. We should also improve the
> ContainerDetector to work more robustly with truncated zips.
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)