[ 
https://issues.apache.org/jira/browse/TIKA-2982?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16972573#comment-16972573
 ] 

Tim Allison commented on TIKA-2982:
-----------------------------------

[~Mr_Jiang], are you able to open these files?  Did you create them with 
MSOffice or another tool?

I can't tell from a quick reading of the spec 
(http://msdn.microsoft.com/en-us/library/cc313071.aspx) if the DataSpaces 
entries are required.

>From the git history, the first "else if" was added as part of TIKA-791, and I 
>suspect it was meant to replace the second else if (that just returned OLE). 

Any objections to removing the requirement for DataSpaces (obv. pending info 
from [~Mr_Jiang])?

> Tika 识别已加密的xlsx、docx、pptx时会把它们错误地识别成doc
> ---------------------------------------
>
>                 Key: TIKA-2982
>                 URL: https://issues.apache.org/jira/browse/TIKA-2982
>             Project: Tika
>          Issue Type: Bug
>          Components: detector
>    Affects Versions: 1.20
>            Reporter: Feng Jiao Jiang
>            Assignee: Tim Allison
>            Priority: Blocker
>         Attachments: 1.docx, 1.xlsx, 2.pptx
>
>
> Tika 识别已加密的xlsx、docx、pptx时会把它们错误地识别成doc



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to