Hey,

I'd need to get the iana.org MediaType  rather than  application/zip
or application/x-tika-msoffice for documents like, odt, ppt, pptx, xlsx etc.

when doing :

MediaType mediaType = MediaType.parse(tika.detect(is));


If you look at mimetypes.xml there are mimeType elements composed of the
iana.org mime-type, alias and "sub-class-of"

   <mime-type type="application/msword">
    <alias type="application/vnd.ms-word"/>
    ............................
    <glob pattern="*.doc"/>
    <glob pattern="*.dot"/>
    <sub-class-of type="application/x-tika-msoffice"/>
  </mime-type>


What is the alias about ? And how to get the iana.org mime-type name instead
of sub-class-of type name ?


Best regards, Jakub

Reply via email to