[ 
https://issues.apache.org/jira/browse/TIKA-950?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Marco Quaranta updated TIKA-950:
--------------------------------

    Description: Method detectOfficeOpenXML() in ZipContainerDetector class 
does not detect correctly an ooxml file due to an Exception throwed by 
OPCPackage.open(..) POI's class. This class make use of ZipFile when its method 
is called with (as Tika do) filePath string and in this way it generates an 
exception; passing instead a fileInputStream makes POI correctly detects 
OfficeOpenXML formats.  (was: Method detectOfficeOpenXML() in 
ZipContainerDetector class does not detect correctly an ooxml file (find 
attachment) due to an Exception throwed by OPCPackage.open(..) POI's class. 
This class make use of ZipFile when its method is called with (as Tika do) 
filePath string and in this way it generates an exception; passing instead a 
fileInputStream makes POI correctly detects OfficeOpenXML formats.)
    
> Wrong Office Open XML detection in ZipContainerDetector
> -------------------------------------------------------
>
>                 Key: TIKA-950
>                 URL: https://issues.apache.org/jira/browse/TIKA-950
>             Project: Tika
>          Issue Type: Bug
>          Components: mime
>            Reporter: Marco Quaranta
>            Priority: Minor
>              Labels: detection, ooxml
>             Fix For: 1.1
>
>         Attachments: ZipContainerDetector.diff
>
>
> Method detectOfficeOpenXML() in ZipContainerDetector class does not detect 
> correctly an ooxml file due to an Exception throwed by OPCPackage.open(..) 
> POI's class. This class make use of ZipFile when its method is called with 
> (as Tika do) filePath string and in this way it generates an exception; 
> passing instead a fileInputStream makes POI correctly detects OfficeOpenXML 
> formats.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Reply via email to