[ https://issues.apache.org/jira/browse/TIKA-1945?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16043571#comment-16043571 ]
Nick Burch commented on TIKA-1945: ---------------------------------- I don't know exactly what Tim'll do, but assuming it's similar to what I'd try... It'll almost certainly need some changes to both Apache POI and Apache Tika, and therefore need to wait for a POI release then a Tika 1.16 release. The patches will be open source, so you'd be most welcome to do a custom local build until then, but it wouldn't be in an official release for a few months > Powerpoint parser doesn't extract text from diagrams > ---------------------------------------------------- > > Key: TIKA-1945 > URL: https://issues.apache.org/jira/browse/TIKA-1945 > Project: Tika > Issue Type: Bug > Components: parser > Affects Versions: 1.12 > Reporter: Nick C > Assignee: Tim Allison > Attachments: Diagram.pptx, TIKA-1945.docx, TIKA-1945.pptx > > > Attached is an example org chart that Tika doesn't extract text from -- This message was sent by Atlassian JIRA (v6.3.15#6346)