[
https://issues.apache.org/jira/browse/TIKA-1945?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16043325#comment-16043325
]
Tim Allison commented on TIKA-1945:
-----------------------------------
Working version of patch handles .docx/pptx/xlsx/xlsb. It does not handle
.doc/ppt/xls. Will commit tonight or tomorrow.
> Powerpoint parser doesn't extract text from diagrams
> ----------------------------------------------------
>
> Key: TIKA-1945
> URL: https://issues.apache.org/jira/browse/TIKA-1945
> Project: Tika
> Issue Type: Bug
> Components: parser
> Affects Versions: 1.12
> Reporter: Nick C
> Assignee: Tim Allison
> Attachments: Diagram.pptx, TIKA-1945.docx, TIKA-1945.pptx
>
>
> Attached is an example org chart that Tika doesn't extract text from
--
This message was sent by Atlassian JIRA
(v6.3.15#6346)