[
https://issues.apache.org/jira/browse/TIKA-1461?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14188555#comment-14188555
]
Nick Burch commented on TIKA-1461:
----------------------------------
I've just tried with a recent snapshot build, and both with and without the
filename the Tika app is able to correctly detect the type:
$ java -jar tika-app-1.7-SNAPSHOT.jar --detect
/tmp/support-healthcheck-plugin-1.0.3.jar
application/java-archive
$ java -jar tika-app-1.7-SNAPSHOT.jar --detect <
/tmp/support-healthcheck-plugin-1.0.3.jar
application/java-archive
Any chance you could retest with a recent nightly build / build from svn trunk,
and see if we've already solved this?
> Bad mime detection of certain JAR file
> --------------------------------------
>
> Key: TIKA-1461
> URL: https://issues.apache.org/jira/browse/TIKA-1461
> Project: Tika
> Issue Type: Bug
> Components: core
> Affects Versions: 1.6
> Reporter: Cservenak, Tamas
>
> Given this "ordinary" Java JAR file
> https://maven.atlassian.com/content/groups/public/com/atlassian/support/healthcheck/support-healthcheck-plugin/1.0.3/support-healthcheck-plugin-1.0.3.jar
> Manually inspected and tested it, it is a Jar file and is valid one.
> Still, Tika Core's Detector detects it as type {{application/x-msdownload;
> format=pe}}. Tthe detection is "hinted" with file name, hence "jar" hint is
> present, still it's not detected as desired {{application/java-archive}}.
> IMO, this happens due to the problem with priority of
> {{application/x-msdownload; format=pe}}, which is 55. If it would be 50, the
> "mediation" would kick in, see TIKA-1292.
> Changing/overriding magic priority is not possible using
> {{custom-mimetypes.xml}} is also not possible.
> Unsure what the correct solution is here, nor how to circumvent this without
> patching Tika.
> The problem affects versions 1.5 but also 1.6, but we target 1.6.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)