[ 
https://issues.apache.org/jira/browse/TIKA-1563?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14342379#comment-14342379
 ] 

Hudson commented on TIKA-1563:
------------------------------

SUCCESS: Integrated in tika-trunk-jdk1.7 #521 (See 
[https://builds.apache.org/job/tika-trunk-jdk1.7/521/])
TIKA-1563 Put the more common gzip file extension (.gz) first in the glob list 
(nick: http://svn.apache.org/viewvc/tika/trunk/?view=rev&rev=1663138)
* 
/tika/trunk/tika-core/src/main/resources/org/apache/tika/mime/tika-mimetypes.xml


> Use .gz as the default extension for application/gzip
> -----------------------------------------------------
>
>                 Key: TIKA-1563
>                 URL: https://issues.apache.org/jira/browse/TIKA-1563
>             Project: Tika
>          Issue Type: Bug
>          Components: core
>            Reporter: Adam Lamar
>            Priority: Minor
>             Fix For: 1.8
>
>
> This ticket is in reference to the following code:
> MimeType mimeType = config.getMimeRepository().forName("application/gzip"); 
> String extension = mimeType.getExtension();
> The string extension will be ".tgz". While it is possible to get .gz or other 
> common extensions with mimeType.getExtensions(), it would be nice if the 
> default was .gz. Every .tgz is an application/gzip, but every 
> application/gzip is not a .tgz. 
> Note that although the code above uses the static mime type application/gzip, 
> it is more likely that real-world code is performing mime type detection on 
> specific files and looking up the returned type in the mime registry. 
> Returning .gz for a gzipped tarball isn't wrong, but is less specific than 
> its real type. However, returning .tgz for gzipped text is incorrect.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to