[
https://issues.apache.org/jira/browse/TIKA-1892?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Nick Burch resolved TIKA-1892.
------------------------------
Resolution: Fixed
Fix Version/s: 1.13
Thanks, SHP added and MOBI updated in 74e71eb
> Mime Magic for application/x-mobipocket-ebook and application/x-shapefile
> -------------------------------------------------------------------------
>
> Key: TIKA-1892
> URL: https://issues.apache.org/jira/browse/TIKA-1892
> Project: Tika
> Issue Type: Improvement
> Components: mime
> Affects Versions: 1.12
> Reporter: Suman Kashyap
> Priority: Minor
> Fix For: 1.13
>
>
> Our FHT analysis for mobipocket-ebook and shapefiles shows high corelation of
> initial header bytes. Further inspection of these files over online available
> and TREC polar data sets revealed presence of common bytes for mime
> identification
> patch content
> <mime-type type="application/x-netcdf">
> <acronym>NETCDF</acronym>
> <_comment>Network Common Data Format</_comment>
> <magic priority="60">
> <match value="CDF" type="string" offset="0" />
> </magic>
> <glob pattern="*.nc"/>
> </mime-type>
> <mime-type type="application/x-mobipocket-ebook">
> <acronym>MOBI</acronym>
> <_comment>Mobipocket Ebook</_comment>
> <magic priority="60">
> <match value="BOOKMOBI" type="string" offset="23" />
> </magic>
> <glob pattern="*.mobi"/>
> </mime-type>
> <mime-type type="application/x-shapefile">
> <acronym>ESRI Shapefiles</acronym>
> <_comment>ESRI Shapefiles</_comment>
> <magic priority="60">
> <match value="0x0000270a" type="big32" offset="2" />
> </magic>
> <glob pattern="*.shp"/>
> </mime-type>
>
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)