[ 
https://issues.apache.org/jira/browse/TIKA-790?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Nick Burch resolved TIKA-790.
-----------------------------

       Resolution: Fixed
    Fix Version/s: 1.1
    
> Reduce duplication between POIFSDocumentType (in OfficeParser) and 
> POIFSContainerDetector
> -----------------------------------------------------------------------------------------
>
>                 Key: TIKA-790
>                 URL: https://issues.apache.org/jira/browse/TIKA-790
>             Project: Tika
>          Issue Type: Improvement
>          Components: parser
>    Affects Versions: 1.0
>            Reporter: Nick Burch
>            Assignee: Nick Burch
>             Fix For: 1.1
>
>
> For historical reasons, we now have two parts of Tika that handle trying to 
> identify the type of an OLE2 based file.
> POIFSDocumentType is able to detect a few kinds of files that 
> POIFSContainerDetector is not able to (eg Encrypted and OLE Native), mostly 
> which may not map well onto mimetypes. POIFSDocumentType also lacks some of 
> the logic in the main detector, and only does the office parser supported 
> files
> We should probably try to reduce the duplication. One option is to add the 
> extra few types into the Detector some how, the other is to use the detector 
> first and do additional specific checks after

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Reply via email to