[ 
https://issues.apache.org/jira/browse/TIKA-2841?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Tim Allison updated TIKA-2841:
------------------------------
    Description: We've done some work on this with docx, etc, but we can do 
more with epub and open office, and, frankly msoffice as well.  We should also 
improve the ContainerDetector to work more robustly with truncated zips.  (was: 
We've done some work on this with docx, etc, but we can do more with epub and 
open office, and, frankly msoffice as well.)

> Improve robustness of parsers of zip-based files on truncated files
> -------------------------------------------------------------------
>
>                 Key: TIKA-2841
>                 URL: https://issues.apache.org/jira/browse/TIKA-2841
>             Project: Tika
>          Issue Type: Task
>            Reporter: Tim Allison
>            Priority: Major
>         Attachments: truncated_10000.zip, truncated_30000.zip
>
>
> We've done some work on this with docx, etc, but we can do more with epub and 
> open office, and, frankly msoffice as well.  We should also improve the 
> ContainerDetector to work more robustly with truncated zips.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to