[
https://issues.apache.org/jira/browse/NUTCH-1045?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13065806#comment-13065806
]
Julien Nioche commented on NUTCH-1045:
--------------------------------------
Does not pass the tests - will investigate later
{quote}
Testsuite: org.apache.nutch.protocol.TestContent
Tests run: 2, Failures: 1, Errors: 0, Time elapsed: 0.127 sec
------------- Standard Output ---------------
2011-07-15 10:16:34,767 INFO conf.Configuration
(Configuration.java:getConfResourceAsInputStream(941)) - tika-mimetypes.xml not
found
2011-07-15 10:16:34,784 ERROR util.MimeUtil (MimeUtil.java:<init>(71)) - Can't
load mime.types.file : tika-mimetypes.xml using Tika's default
------------- ---------------- ---------------
Testcase: testContent took 0.109 sec
Testcase: testGetContentType took 0.005 sec
FAILED
null expected:<[text/html]> but was:<[application/octet-stream]>
junit.framework.ComparisonFailure: null expected:<[text/html]> but
was:<[application/octet-stream]>
at
org.apache.nutch.protocol.TestContent.testGetContentType(TestContent.java:72)
{quote}
> MimeUtil to rely on default config provided by Tika
> ---------------------------------------------------
>
> Key: NUTCH-1045
> URL: https://issues.apache.org/jira/browse/NUTCH-1045
> Project: Nutch
> Issue Type: Improvement
> Affects Versions: 1.4, 2.0
> Reporter: Julien Nioche
> Priority: Minor
> Fix For: 1.4, 2.0
>
> Attachments: NUTCH-1045-1.4.patch
>
>
> We currently provide conf/tika-mimetypes.xml despite the fact that it is
> absolutely similar to the one found in tika-core.jar
> Having a mechanism for specifying a custom tika-mimetypes.xml is good though
> but if the user hasn't specified one or if it can't be loaded then we should
> rely on Tika's default. This way we won't need to provide
> conf/tika-mimetypes.xml anymore and keep it in sync with the default Tika one
> whenever we upgrade Tika.
--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira