[
https://issues.apache.org/jira/browse/TIKA-1945?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16042794#comment-16042794
]
Oytun Tez edited comment on TIKA-1945 at 6/8/17 2:44 PM:
---------------------------------------------------------
This is a confirmed issue for 1.15 as well. `./ppt/diagrams/*.xml` files are
not processed. If there is a quick work around this, we would like to do it.
This is currently a production issue for us.
was (Author: oytun):
This is a confirmed issue. `./ppt/diagrams/*.xml` files are not processed. If
there is a quick work around this, we would like to do it. This is currently a
production issue for us.
> Powerpoint parser doesn't extract text from diagrams
> ----------------------------------------------------
>
> Key: TIKA-1945
> URL: https://issues.apache.org/jira/browse/TIKA-1945
> Project: Tika
> Issue Type: Bug
> Components: parser
> Affects Versions: 1.12
> Reporter: Nick C
> Attachments: Diagram.pptx
>
>
> Attached is an example org chart that Tika doesn't extract text from
--
This message was sent by Atlassian JIRA
(v6.3.15#6346)