[
https://issues.apache.org/jira/browse/TIKA-1563?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14342379#comment-14342379
]
Hudson commented on TIKA-1563:
------------------------------
SUCCESS: Integrated in tika-trunk-jdk1.7 #521 (See
[https://builds.apache.org/job/tika-trunk-jdk1.7/521/])
TIKA-1563 Put the more common gzip file extension (.gz) first in the glob list
(nick: http://svn.apache.org/viewvc/tika/trunk/?view=rev&rev=1663138)
*
/tika/trunk/tika-core/src/main/resources/org/apache/tika/mime/tika-mimetypes.xml
> Use .gz as the default extension for application/gzip
> -----------------------------------------------------
>
> Key: TIKA-1563
> URL: https://issues.apache.org/jira/browse/TIKA-1563
> Project: Tika
> Issue Type: Bug
> Components: core
> Reporter: Adam Lamar
> Priority: Minor
> Fix For: 1.8
>
>
> This ticket is in reference to the following code:
> MimeType mimeType = config.getMimeRepository().forName("application/gzip");
> String extension = mimeType.getExtension();
> The string extension will be ".tgz". While it is possible to get .gz or other
> common extensions with mimeType.getExtensions(), it would be nice if the
> default was .gz. Every .tgz is an application/gzip, but every
> application/gzip is not a .tgz.
> Note that although the code above uses the static mime type application/gzip,
> it is more likely that real-world code is performing mime type detection on
> specific files and looking up the returned type in the mime registry.
> Returning .gz for a gzipped tarball isn't wrong, but is less specific than
> its real type. However, returning .tgz for gzipped text is incorrect.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)