[ 
https://issues.apache.org/jira/browse/TIKA-1264?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13946671#comment-13946671
 ] 

Nick Burch commented on TIKA-1264:
----------------------------------

Do you know what container format (if any) PST is based on?

Only the magic doesn't look like a normal OLE2 signature, but the mime 
definition you've posted says it is based on it (from the subclass)

> Improve PST file detection
> --------------------------
>
>                 Key: TIKA-1264
>                 URL: https://issues.apache.org/jira/browse/TIKA-1264
>             Project: Tika
>          Issue Type: Improvement
>          Components: detector
>    Affects Versions: 1.5
>            Reporter: Luis Filipe Nassif
>            Priority: Trivial
>
> Please update the PST mime-type definition to:
> {code}
> <mime-type type="application/vnd.ms-outlook-pst">    
>       <_comment>Outlook Personal Folders File Format</_comment>
>       <magic priority="50">
>               <match value="!BDN....SM" type="string" offset="0" 
> mask="0xFFFFFFFF00000000FFFF"/>
>       </magic> 
>       <glob pattern="*.pst"/>
>       <glob pattern="*.ost"/>
>       <sub-class-of type="application/x-tika-msoffice"/>      
> </mime-type>
> {code}
> File header description: 
> http://msdn.microsoft.com/en-us/library/ff387474(v=office.12).aspx



--
This message was sent by Atlassian JIRA
(v6.2#6252)

Reply via email to