[
https://issues.apache.org/jira/browse/TIKA-950?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Marco Quaranta updated TIKA-950:
--------------------------------
Description: Method detectOfficeOpenXML() in ZipContainerDetector class
does not detect correctly an ooxml file due to an Exception throwed by
OPCPackage.open(..) POI's class. This class make use of ZipFile when its method
is called with (as Tika do) filePath string and in this way it generates an
exception; passing instead a fileInputStream makes POI correctly detects
OfficeOpenXML formats. (was: Method detectOfficeOpenXML() in
ZipContainerDetector class does not detect correctly an ooxml file (find
attachment) due to an Exception throwed by OPCPackage.open(..) POI's class.
This class make use of ZipFile when its method is called with (as Tika do)
filePath string and in this way it generates an exception; passing instead a
fileInputStream makes POI correctly detects OfficeOpenXML formats.)
> Wrong Office Open XML detection in ZipContainerDetector
> -------------------------------------------------------
>
> Key: TIKA-950
> URL: https://issues.apache.org/jira/browse/TIKA-950
> Project: Tika
> Issue Type: Bug
> Components: mime
> Reporter: Marco Quaranta
> Priority: Minor
> Labels: detection, ooxml
> Fix For: 1.1
>
> Attachments: ZipContainerDetector.diff
>
>
> Method detectOfficeOpenXML() in ZipContainerDetector class does not detect
> correctly an ooxml file due to an Exception throwed by OPCPackage.open(..)
> POI's class. This class make use of ZipFile when its method is called with
> (as Tika do) filePath string and in this way it generates an exception;
> passing instead a fileInputStream makes POI correctly detects OfficeOpenXML
> formats.
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators:
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira