[
https://issues.apache.org/jira/browse/TIKA-562?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12965590#action_12965590
]
Antoni Mylka commented on TIKA-562:
-----------------------------------
Your unit tests test identification by name and by data. This problem is
exhibited when you try to identify a file using both name and data (quite a
common case). The patch modifies five mime type definitions. Four of them
already have their examples in test-documents. These are: testEXCEL.xlsb,
testPPT.pptm, testPPT.potm, testPPT.pptm. Only ppam is missing. Will see what I
can do.
> In tika-mimetypes.xml OpenXML types should have x-tika-ooxml as their parent
> ----------------------------------------------------------------------------
>
> Key: TIKA-562
> URL: https://issues.apache.org/jira/browse/TIKA-562
> Project: Tika
> Issue Type: Bug
> Reporter: Antoni Mylka
> Attachments: ooxml-children.patch
>
>
> A couple of file types have application/x-tika-msoffice as their parent, when
> they should have application/x-tika-ooxml. This error is exhibited when you
> try to identify those files with both name and data. The data is found to be
> x-tika-ooxml, while the type found with the name is correct, but since it's
> not a subtype of x-tika-ooxml - it is not returned.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.