[
https://issues.apache.org/jira/browse/TIKA-2982?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16972573#comment-16972573
]
Tim Allison commented on TIKA-2982:
-----------------------------------
[~Mr_Jiang], are you able to open these files? Did you create them with
MSOffice or another tool?
I can't tell from a quick reading of the spec
(http://msdn.microsoft.com/en-us/library/cc313071.aspx) if the DataSpaces
entries are required.
>From the git history, the first "else if" was added as part of TIKA-791, and I
>suspect it was meant to replace the second else if (that just returned OLE).
Any objections to removing the requirement for DataSpaces (obv. pending info
from [~Mr_Jiang])?
> Tika 识别已加密的xlsx、docx、pptx时会把它们错误地识别成doc
> ---------------------------------------
>
> Key: TIKA-2982
> URL: https://issues.apache.org/jira/browse/TIKA-2982
> Project: Tika
> Issue Type: Bug
> Components: detector
> Affects Versions: 1.20
> Reporter: Feng Jiao Jiang
> Assignee: Tim Allison
> Priority: Blocker
> Attachments: 1.docx, 1.xlsx, 2.pptx
>
>
> Tika 识别已加密的xlsx、docx、pptx时会把它们错误地识别成doc
--
This message was sent by Atlassian Jira
(v8.3.4#803005)