[ 
https://issues.apache.org/jira/browse/TIKA-1882?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Chris A. Mattmann resolved TIKA-1882.
-------------------------------------
    Resolution: Fixed

- applied updated MIME per [~gagravarr] comments and [[email protected]] work. 
Thanks!

{noformat}
LMC-053601:tika1.13 mattmann$ git push -u origin master
Counting objects: 14, done.
Delta compression using up to 8 threads.
Compressing objects: 100% (10/10), done.
Writing objects: 100% (14/14), 1.26 KiB | 0 bytes/s, done.
Total 14 (delta 5), reused 0 (delta 0)
remote: tika git commit: Record change for TIKA-1882 this closes #82.
remote: tika git commit: Fix for TIKA-1882: .cab, .xar, .mobi and .mov files 
from the TREC-DD-Polar dataset. This closes #82.
To https://git-wip-us.apache.org/repos/asf/tika.git
   f61a4ed..3d59471  master -> master
Branch master set up to track remote branch master from origin.
LMC-053601:tika1.13 mattmann$ 
{noformat}


> Scientific MIME updates to .cab files, .xar and .mobi and .mov files based on 
> TREC-DD-Polar analysis
> ----------------------------------------------------------------------------------------------------
>
>                 Key: TIKA-1882
>                 URL: https://issues.apache.org/jira/browse/TIKA-1882
>             Project: Tika
>          Issue Type: Sub-task
>          Components: mime
>    Affects Versions: 1.11
>            Reporter: Manisha Kampasi
>            Assignee: Chris A. Mattmann
>            Priority: Minor
>              Labels: memex, nsfpolar, patch
>             Fix For: 1.13
>
>
> The following mime magic can be added to better detect the below mime-types:
> 1. vnd.ms-cab-compressed (.cab files) - pattern "MCSF" in the first 4 bytes
> 2. application/vnd.xara (.xar files) - pattern "xar!" in the first 4 bytes
> 3. application/x-mobipocket-ebook (.mobi files) - pattern "BOOKMOBI" starting 
> at byte position 60
> 4. video/quicktime (.mov files) - patterns "free" and "wide" seen starting at 
> byte position 4
> The changes can be seen here:
> https://github.com/mkampasi/tika/commit/f7433daf434a44937ba3ae8b15813a768f95e334



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to