feng ye created TIKA-2735:
-----------------------------
Summary: notes and footer contents are duplicated in extracting
text from power point slides
Key: TIKA-2735
URL: https://issues.apache.org/jira/browse/TIKA-2735
Project: Tika
Issue Type: Bug
Components: handler
Affects Versions: 1.18
Reporter: feng ye
Attachments: Oneslide.ppt, pptTextResults.txt
notes and footer contents are duplicated at the end when extract text from ppt
slides (like the one in the attachment). Both the input file and the text
results are attached.
Is there a configuration option that can be used to suppress this kind of
duplication?
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)