[
https://issues.apache.org/jira/browse/TIKA-1114?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14371945#comment-14371945
]
Tyler Palsulich commented on TIKA-1114:
---------------------------------------
See http://en.wikipedia.org/wiki/Standard_Generalized_Markup_Language. It seems
like there isn't really a dedicated way to know whether is a file is SGML or
not...
> sgml mime type is not detected when passed in as byte stream
> ------------------------------------------------------------
>
> Key: TIKA-1114
> URL: https://issues.apache.org/jira/browse/TIKA-1114
> Project: Tika
> Issue Type: Bug
> Components: mime
> Reporter: Vikas Garg
>
> When passing sgml files as TikaInputStream (created from byte[]) to
> Detector.detect(), it returns text/plain as mediatype and not
> application/sgml or text/sgml. But when I provide the file name to metadata,
> then it gives me correct mime-type, i.e., text/sgml.
> Is it because Tika is missing any designated parser for sgml files OR am I
> missing something? I am on Tika-1.3.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)