[
https://issues.apache.org/jira/browse/TIKA-1945?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16042834#comment-16042834
]
Oytun Tez commented on TIKA-1945:
---------------------------------
I believe this may be due to XSLFRelation from Apache POI not providing a
relation for `./ppt/diagram` directory.
If one of the Tika or POI developers can give us clues on how to solve this the
fastest -as we are not familiar with the code base, that would be fantastic! :)
There is a similar issue in POI bug database:
https://bz.apache.org/bugzilla/show_bug.cgi?id=57596
> Powerpoint parser doesn't extract text from diagrams
> ----------------------------------------------------
>
> Key: TIKA-1945
> URL: https://issues.apache.org/jira/browse/TIKA-1945
> Project: Tika
> Issue Type: Bug
> Components: parser
> Affects Versions: 1.12
> Reporter: Nick C
> Attachments: Diagram.pptx
>
>
> Attached is an example org chart that Tika doesn't extract text from
--
This message was sent by Atlassian JIRA
(v6.3.15#6346)