Tim Allison created TIKA-2026:
---------------------------------

             Summary: Handle embedded comp_obj/ oleObject.bin files stored in 
PPT/X
                 Key: TIKA-2026
                 URL: https://issues.apache.org/jira/browse/TIKA-2026
             Project: Tika
          Issue Type: Improvement
            Reporter: Tim Allison
            Priority: Minor


When some files (e.g. pdfs) are embedded in PPT and PPTX, they are wrapped in 
an OLE compobj.  It would be nice if we could extract the actual files from 
these wrappers.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to