[
https://issues.apache.org/jira/browse/TIKA-1945?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16043724#comment-16043724
]
Hudson commented on TIKA-1945:
------------------------------
FAILURE: Integrated in Jenkins build Tika-trunk #1288 (See
[https://builds.apache.org/job/Tika-trunk/1288/])
TIKA-1945 -- extract text from diagrams in ooxml files. (tallison:
[https://github.com/apache/tika/commit/7842600560e02a4fd213d175301b4397bbe030a3])
* (add)
tika-parsers/src/test/resources/test-documents/testEXCEL_diagramData.xlsb
* (add) tika-parsers/src/test/resources/test-documents/testPPT_diagramData.pptx
* (edit)
tika-parsers/src/main/java/org/apache/tika/parser/microsoft/ooxml/XWPFWordExtractorDecorator.java
* (edit)
tika-parsers/src/test/java/org/apache/tika/parser/microsoft/ooxml/SXSLFExtractorTest.java
* (edit)
tika-parsers/src/main/java/org/apache/tika/parser/microsoft/ooxml/OOXMLExtractorFactory.java
* (add)
tika-parsers/src/test/resources/test-documents/testEXCEL_diagramData.xlsx
* (edit)
tika-parsers/src/test/java/org/apache/tika/parser/microsoft/ooxml/OOXMLParserTest.java
* (edit)
tika-parsers/src/main/java/org/apache/tika/parser/microsoft/ooxml/AbstractOOXMLExtractor.java
* (add) tika-parsers/src/test/resources/test-documents/testWORD_diagramData.docx
* (edit) CHANGES.txt
* (edit)
tika-parsers/src/main/java/org/apache/tika/parser/microsoft/ooxml/SXSLFPowerPointExtractorDecorator.java
* (edit)
tika-parsers/src/main/java/org/apache/tika/parser/microsoft/ooxml/XSSFExcelExtractorDecorator.java
* (edit)
tika-parsers/src/main/java/org/apache/tika/parser/microsoft/ooxml/XSLFPowerPointExtractorDecorator.java
* (edit)
tika-parsers/src/main/java/org/apache/tika/parser/microsoft/ooxml/SXWPFWordExtractorDecorator.java
* (edit)
tika-parsers/src/test/java/org/apache/tika/parser/microsoft/ooxml/SXWPFExtractorTest.java
* (edit)
tika-parsers/src/main/java/org/apache/tika/parser/microsoft/ooxml/XSSFBExcelExtractorDecorator.java
> Powerpoint parser doesn't extract text from diagrams
> ----------------------------------------------------
>
> Key: TIKA-1945
> URL: https://issues.apache.org/jira/browse/TIKA-1945
> Project: Tika
> Issue Type: Bug
> Components: parser
> Affects Versions: 1.12
> Reporter: Nick C
> Assignee: Tim Allison
> Attachments: Diagram.pptx, TIKA-1945.docx, TIKA-1945.pptx
>
>
> Attached is an example org chart that Tika doesn't extract text from
--
This message was sent by Atlassian JIRA
(v6.3.15#6346)