Chris -

I'm not sure...on the one hand, since Tika is basically a text parsing tool,
we might want to make plain text the default MIME type.  We couldn't really
do anything with an octet stream anyway, right?  

On the other hand, we wouldn't want to attempt to parse something that does
not have text, so a nonparseable MIME type such as octet stream as default
might make more sense.

Isn't our framework supposed to determine the MIME type based on the
content?  Is there perhaps just a configuration or code change that needs to
be made?  If so, then this is not an issue.

- Keith


Chris Mattmann wrote:
> 
> Hi Keith,
> 
>  The default mime type in TIKA is application/octet-stream. It gets set
> when
> the mime type can't be determined using 3 main means (url resolution,
> extension resolution, or magic chars). This is in the MimeTypes.java file
> within the mime package. The reason no parser gets called is because there
> is no parser registered to handle that mime type.
> 
>  Are you suggesting that there is another, more sensible default?
> 
> Thanks!
> 
> Cheers,
>   Chris
> 
> 

-- 
View this message in context: 
http://www.nabble.com/Default-MIME-Type--tf4609978.html#a13165862
Sent from the Apache Tika - Development mailing list archive at Nabble.com.

Reply via email to