[ 
https://issues.apache.org/jira/browse/TIKA-562?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12965590#action_12965590
 ] 

Antoni Mylka commented on TIKA-562:
-----------------------------------

Your unit tests test identification by name and by data. This problem is 
exhibited when you try to identify a file using both name and data (quite a 
common case). The patch modifies five mime type definitions. Four of them 
already have their examples in test-documents. These are: testEXCEL.xlsb, 
testPPT.pptm, testPPT.potm, testPPT.pptm. Only ppam is missing. Will see what I 
can do.

> In tika-mimetypes.xml OpenXML types should have x-tika-ooxml as their parent
> ----------------------------------------------------------------------------
>
>                 Key: TIKA-562
>                 URL: https://issues.apache.org/jira/browse/TIKA-562
>             Project: Tika
>          Issue Type: Bug
>            Reporter: Antoni Mylka
>         Attachments: ooxml-children.patch
>
>
> A couple of file types have application/x-tika-msoffice as their parent, when 
> they should have application/x-tika-ooxml. This error is exhibited when you 
> try to identify those files with both name and data. The data is found to be 
> x-tika-ooxml, while the type found with the name is correct, but since it's 
> not a subtype of x-tika-ooxml - it is not returned.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to