[
https://issues.apache.org/jira/browse/TIKA-1264?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13946671#comment-13946671
]
Nick Burch commented on TIKA-1264:
----------------------------------
Do you know what container format (if any) PST is based on?
Only the magic doesn't look like a normal OLE2 signature, but the mime
definition you've posted says it is based on it (from the subclass)
> Improve PST file detection
> --------------------------
>
> Key: TIKA-1264
> URL: https://issues.apache.org/jira/browse/TIKA-1264
> Project: Tika
> Issue Type: Improvement
> Components: detector
> Affects Versions: 1.5
> Reporter: Luis Filipe Nassif
> Priority: Trivial
>
> Please update the PST mime-type definition to:
> {code}
> <mime-type type="application/vnd.ms-outlook-pst">
> <_comment>Outlook Personal Folders File Format</_comment>
> <magic priority="50">
> <match value="!BDN....SM" type="string" offset="0"
> mask="0xFFFFFFFF00000000FFFF"/>
> </magic>
> <glob pattern="*.pst"/>
> <glob pattern="*.ost"/>
> <sub-class-of type="application/x-tika-msoffice"/>
> </mime-type>
> {code}
> File header description:
> http://msdn.microsoft.com/en-us/library/ff387474(v=office.12).aspx
--
This message was sent by Atlassian JIRA
(v6.2#6252)