[
https://issues.apache.org/jira/browse/TIKA-950?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Marco Quaranta updated TIKA-950:
--------------------------------
Attachment: ZipContainerDetector.diff
> Wrong Office Open XML detection in ZipContainerDetector
> -------------------------------------------------------
>
> Key: TIKA-950
> URL: https://issues.apache.org/jira/browse/TIKA-950
> Project: Tika
> Issue Type: Bug
> Components: mime
> Reporter: Marco Quaranta
> Priority: Minor
> Labels: detection, ooxml
> Fix For: 1.1
>
> Attachments: ZipContainerDetector.diff
>
>
> Method detectOfficeOpenXML() in ZipContainerDetector class does not detect
> correctly an ooxml file (find attachment) due to an Exception throwed by
> OPCPackage.open(..) POI's class. This class make use of ZipFile when its
> method is called with (as Tika do) filePath string and in this way it
> generates an exception; passing instead a fileInputStream makes POI correctly
> detects OfficeOpenXML formats.
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators:
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira