[ 
https://issues.apache.org/jira/browse/TIKA-2086?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Akash Sudhakar updated TIKA-2086:
---------------------------------
    Description: 
During doc file extraction using tika 1.13, while trying to extract document 
text, metadata details also coming as output.
File extension is .doc, but tika is detecting it as application/xml.



> Metadata also getting extracted along with document text
> --------------------------------------------------------
>
>                 Key: TIKA-2086
>                 URL: https://issues.apache.org/jira/browse/TIKA-2086
>             Project: Tika
>          Issue Type: Bug
>    Affects Versions: 1.13
>            Reporter: Akash Sudhakar
>
> During doc file extraction using tika 1.13, while trying to extract document 
> text, metadata details also coming as output.
> File extension is .doc, but tika is detecting it as application/xml.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to