[ 
https://issues.apache.org/jira/browse/NUTCH-1230?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13174023#comment-13174023
 ] 

Markus Jelsma commented on NUTCH-1230:
--------------------------------------

We previously used the byte[] as input but Tika now required java.io.File:
http://tika.apache.org/1.0/api/org/apache/tika/Tika.html#detect%28java.io.File%29

in o.a.n.Content we have the byte[] but must pass a File to 
MimeUtil.autoResolveContentType(). But i have no idea how i can convert an 
in-memory byte[] to a File?! 

I hate getting stuck again, any advice would be more then helpful!
                
> MimeType utils broken with Tika 1.1
> -----------------------------------
>
>                 Key: NUTCH-1230
>                 URL: https://issues.apache.org/jira/browse/NUTCH-1230
>             Project: Nutch
>          Issue Type: Bug
>            Reporter: Markus Jelsma
>            Assignee: Markus Jelsma
>             Fix For: 1.5
>
>
> We used Tika 1.0-SNAPSHOT in production and just switched to 1.1-SNAPSHOT. 
> The new version triggers the following error:
> {code}
> 2011-12-21 12:29:56,665 ERROR http.Http - java.lang.IllegalAccessError: tried 
> to access method 
> org.apache.tika.mime.MimeTypes.getMimeType([B)Lorg/apache/tika/mime/MimeType; 
> from class org.apache.nutch.util.MimeUtil
> 2011-12-21 12:29:56,665 ERROR http.Http - at 
> org.apache.nutch.util.MimeUtil.autoResolveContentType(MimeUtil.java:169)
> 2011-12-21 12:29:56,665 ERROR http.Http - at 
> org.apache.nutch.protocol.Content.getContentType(Content.java:292)
> 2011-12-21 12:29:56,666 ERROR http.Http - at 
> org.apache.nutch.protocol.Content.<init>(Content.java:88)
> 2011-12-21 12:29:56,666 ERROR http.Http - at 
> org.apache.nutch.protocol.http.api.HttpBase.getProtocolOutput(HttpBase.java:142)
> 2011-12-21 12:29:56,666 ERROR http.Http - at 
> org.apache.nutch.parse.ParserChecker.run(ParserChecker.java:82)
> 2011-12-21 12:29:56,666 ERROR http.Http - at 
> org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:69)
> 2011-12-21 12:29:56,666 ERROR http.Http - at 
> org.apache.nutch.parse.ParserChecker.main(ParserChecker.java:138)
> {code}

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Reply via email to