feng ye created TIKA-2735:
-----------------------------

             Summary: notes and footer contents are duplicated in extracting 
text from power point slides
                 Key: TIKA-2735
                 URL: https://issues.apache.org/jira/browse/TIKA-2735
             Project: Tika
          Issue Type: Bug
          Components: handler
    Affects Versions: 1.18
            Reporter: feng ye
         Attachments: Oneslide.ppt, pptTextResults.txt

notes and footer contents are duplicated at the end when extract text from ppt 
slides (like the one in the attachment). Both the input file and the text 
results are attached. 

Is there a configuration option that can be used to suppress this kind of 
duplication?



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to