Peter Davies created TIKA-2524:
----------------------------------
Summary: Apache Tika returns empty string when parsing text from
XPS files
Key: TIKA-2524
URL: https://issues.apache.org/jira/browse/TIKA-2524
Project: Tika
Issue Type: Bug
Components: parser
Affects Versions: 1.16
Reporter: Peter Davies
When we parse XPS files using the AutoParser we always get an empty string.
If we use DefaultDetector.detect() it correctly detects the MediaType as
"application/vnd.ms-xpsdocument".
This page
https://tika.apache.org/1.16/formats.html
suggests that XPS (application/vnd.ms-xpsdocument) is supported however.
--
This message was sent by Atlassian JIRA
(v6.4.14#64029)