[
https://issues.apache.org/jira/browse/TIKA-2026?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tim Allison resolved TIKA-2026.
-------------------------------
Resolution: Fixed
Assignee: Tim Allison
Fix Version/s: 1.14
2.0
> Handle OLE 2.0 embedded non-Office document in PPT/X and XLSX
> -------------------------------------------------------------
>
> Key: TIKA-2026
> URL: https://issues.apache.org/jira/browse/TIKA-2026
> Project: Tika
> Issue Type: Bug
> Reporter: Tim Allison
> Assignee: Tim Allison
> Fix For: 2.0, 1.14
>
> Attachments: oleObject1.bin, testEmbedded3.pptx
>
>
> When some files (e.g. pdfs) are embedded in XLSX, PPT and PPTX, they are
> wrapped in an OLE compobj. In TIKA-704, we added handling for these types of
> embedded files in DOC/DOCX files. We need to make a few modifications to
> extract these in XLSX, PPT and PPTX.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)