[
https://issues.apache.org/jira/browse/TIKA-1892?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15182205#comment-15182205
]
Hudson commented on TIKA-1892:
------------------------------
SUCCESS: Integrated in tika-trunk-jdk1.7 #923 (See
[https://builds.apache.org/job/tika-trunk-jdk1.7/923/])
Magic for Mobipocket Ebook and ESRI Shapefiles from TIKA-1892 from Suman (nick:
rev 74e71ebd871172c3473719d0814400f69d4c8913)
* tika-core/src/main/resources/org/apache/tika/mime/tika-mimetypes.xml
> Mime Magic for application/x-mobipocket-ebook and application/x-shapefile
> -------------------------------------------------------------------------
>
> Key: TIKA-1892
> URL: https://issues.apache.org/jira/browse/TIKA-1892
> Project: Tika
> Issue Type: Improvement
> Components: mime
> Affects Versions: 1.12
> Reporter: Suman Kashyap
> Priority: Minor
> Fix For: 1.13
>
>
> Our FHT analysis for mobipocket-ebook and shapefiles shows high corelation of
> initial header bytes. Further inspection of these files over online available
> and TREC polar data sets revealed presence of common bytes for mime
> identification
> patch content
> <mime-type type="application/x-netcdf">
> <acronym>NETCDF</acronym>
> <_comment>Network Common Data Format</_comment>
> <magic priority="60">
> <match value="CDF" type="string" offset="0" />
> </magic>
> <glob pattern="*.nc"/>
> </mime-type>
> <mime-type type="application/x-mobipocket-ebook">
> <acronym>MOBI</acronym>
> <_comment>Mobipocket Ebook</_comment>
> <magic priority="60">
> <match value="BOOKMOBI" type="string" offset="23" />
> </magic>
> <glob pattern="*.mobi"/>
> </mime-type>
> <mime-type type="application/x-shapefile">
> <acronym>ESRI Shapefiles</acronym>
> <_comment>ESRI Shapefiles</_comment>
> <magic priority="60">
> <match value="0x0000270a" type="big32" offset="2" />
> </magic>
> <glob pattern="*.shp"/>
> </mime-type>
>
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)