[ https://issues.apache.org/jira/browse/TIKA-2086?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15509655#comment-15509655 ]
Nick Burch commented on TIKA-2086: ---------------------------------- How are you calling Apache Tika? Is this happening for all files, or just one? If just one or two, can you please post a sample file? > Metadata also getting extracted along with document text > -------------------------------------------------------- > > Key: TIKA-2086 > URL: https://issues.apache.org/jira/browse/TIKA-2086 > Project: Tika > Issue Type: Bug > Affects Versions: 1.13 > Reporter: Akash Sudhakar > > During doc file extraction using tika 1.13, while trying to extract document > text, metadata details also coming as output. > File extension is .doc, but tika is detecting it as application/xml. -- This message was sent by Atlassian JIRA (v6.3.4#6332)