[ 
https://issues.apache.org/jira/browse/TIKA-121?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12567450#action_12567450
 ] 

Chris A. Mattmann commented on TIKA-121:
----------------------------------------

Hi Jukka:

Thanks for the explanation. Well, in order for Tika to be useful in Nutch in an 
out-of-the-box fashion (without having any special mime utility code in Nutch), 
we need it to have the ability to handle mime types returned from the server, 
in the form of:

<primary type>/<sub type> ; <optional additional parameters>

So, a perfect example of this would to to take the string:

"text/html; charset=UTF-8"

And then "clean" it to parse out the mime type portion, "text/html", and drop 
the optional params. I suppose one could make the argument that this is a 
web-specific feature, so it belongs in Nutch, however, I'm not positive that 
this only occurs in the web. Thoughts?

Cheers,
 Chris


> MimeType.clean method no longer exists as a capability
> ------------------------------------------------------
>
>                 Key: TIKA-121
>                 URL: https://issues.apache.org/jira/browse/TIKA-121
>             Project: Tika
>          Issue Type: Bug
>          Components: mime
>    Affects Versions: 0.1-incubating
>            Reporter: Chris A. Mattmann
>            Assignee: Chris A. Mattmann
>             Fix For: 0.2-incubating
>
>
> For some reason, in r591743 
> (http://svn.apache.org/viewvc?rev=591743&view=rev), the MimeType.clean 
> functionality was removed and never replaced. This is a problem because that 
> functionality was somewhat necessary as I'm running into the problem of 
> trying to upgrade Nutch to tika-0.1-incubating and Nutch relied on 
> MimeType.clean.
> I've been scratching my head trying to determine an appropriate workaround 
> for the same capability within the tika-0.1-incubating code, but have yet to 
> find one. This functionality needs to be replaced in some form or fashion, 
> or, if someone knows of a simple way to achieve the same functionality, 
> please let me know.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to