[
https://issues.apache.org/jira/browse/NUTCH-1230?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13174023#comment-13174023
]
Markus Jelsma commented on NUTCH-1230:
--------------------------------------
We previously used the byte[] as input but Tika now required java.io.File:
http://tika.apache.org/1.0/api/org/apache/tika/Tika.html#detect%28java.io.File%29
in o.a.n.Content we have the byte[] but must pass a File to
MimeUtil.autoResolveContentType(). But i have no idea how i can convert an
in-memory byte[] to a File?!
I hate getting stuck again, any advice would be more then helpful!
> MimeType utils broken with Tika 1.1
> -----------------------------------
>
> Key: NUTCH-1230
> URL: https://issues.apache.org/jira/browse/NUTCH-1230
> Project: Nutch
> Issue Type: Bug
> Reporter: Markus Jelsma
> Assignee: Markus Jelsma
> Fix For: 1.5
>
>
> We used Tika 1.0-SNAPSHOT in production and just switched to 1.1-SNAPSHOT.
> The new version triggers the following error:
> {code}
> 2011-12-21 12:29:56,665 ERROR http.Http - java.lang.IllegalAccessError: tried
> to access method
> org.apache.tika.mime.MimeTypes.getMimeType([B)Lorg/apache/tika/mime/MimeType;
> from class org.apache.nutch.util.MimeUtil
> 2011-12-21 12:29:56,665 ERROR http.Http - at
> org.apache.nutch.util.MimeUtil.autoResolveContentType(MimeUtil.java:169)
> 2011-12-21 12:29:56,665 ERROR http.Http - at
> org.apache.nutch.protocol.Content.getContentType(Content.java:292)
> 2011-12-21 12:29:56,666 ERROR http.Http - at
> org.apache.nutch.protocol.Content.<init>(Content.java:88)
> 2011-12-21 12:29:56,666 ERROR http.Http - at
> org.apache.nutch.protocol.http.api.HttpBase.getProtocolOutput(HttpBase.java:142)
> 2011-12-21 12:29:56,666 ERROR http.Http - at
> org.apache.nutch.parse.ParserChecker.run(ParserChecker.java:82)
> 2011-12-21 12:29:56,666 ERROR http.Http - at
> org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:69)
> 2011-12-21 12:29:56,666 ERROR http.Http - at
> org.apache.nutch.parse.ParserChecker.main(ParserChecker.java:138)
> {code}
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators:
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira