[
https://issues.apache.org/jira/browse/TIKA-2086?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15509655#comment-15509655
]
Nick Burch commented on TIKA-2086:
----------------------------------
How are you calling Apache Tika? Is this happening for all files, or just one?
If just one or two, can you please post a sample file?
> Metadata also getting extracted along with document text
> --------------------------------------------------------
>
> Key: TIKA-2086
> URL: https://issues.apache.org/jira/browse/TIKA-2086
> Project: Tika
> Issue Type: Bug
> Affects Versions: 1.13
> Reporter: Akash Sudhakar
>
> During doc file extraction using tika 1.13, while trying to extract document
> text, metadata details also coming as output.
> File extension is .doc, but tika is detecting it as application/xml.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)