[
https://issues.apache.org/jira/browse/TIKA-1221?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13873175#comment-13873175
]
Marco Quaranta commented on TIKA-1221:
--------------------------------------
I agree with you but ... if you look at "application/x-tika-ooxml" in
tika-mimetypes.xml you will find a similar approach (use of match and offset in
zip files).
Anyways, are you thinking something like I porposed for DWFX files
(https://issues.apache.org/jira/browse/TIKA-1204) ?
> XPS detection
> -------------
>
> Key: TIKA-1221
> URL: https://issues.apache.org/jira/browse/TIKA-1221
> Project: Tika
> Issue Type: Bug
> Components: detector, mime
> Affects Versions: 1.4
> Reporter: Marco Quaranta
>
> Tika now detect xps files using only file extension.
> Please modify XPS definition in tika-mimetypes.xml in the following way:
> {noformat}
> <mime-type type="application/vnd.ms-xpsdocument">
> <glob pattern="*.xps"/>
> <sub-class-of type="application/zip" />
> <magic priority="50">
> <match value="PK\003\004" type="string" offset="0">
> <match value="Metadata/Job_PT.xml"
> type="string" offset="30" />
> </match>
> </magic>
> </mime-type>
> {noformat}
> Thank you,
> Marco
--
This message was sent by Atlassian JIRA
(v6.1.5#6160)