[
https://issues.apache.org/jira/browse/TIKA-684?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Jonathan LI resolved TIKA-684.
------------------------------
Resolution: Not A Problem
False Issue
> Partial/Incomplete text extraction for certain Powerpoint files
> ---------------------------------------------------------------
>
> Key: TIKA-684
> URL: https://issues.apache.org/jira/browse/TIKA-684
> Project: Tika
> Issue Type: Bug
> Components: parser
> Affects Versions: 0.9
> Reporter: Jonathan LI
> Attachments: 2eebe3db1196aa8ea58c9be83965f0eb.ppt
>
>
> Example file with issue attached.
> Tika throws exception during text extraction of certain powerpoints. In this
> example file, the extracted text only goes up to slide 37. Text from slides
> 38-40 are missing.
> Tested via both tika library and tika GUI. Apache POI (3.8 beta 3 & 3.7)
> doesn't have any issues with text extraction of this file.
--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira