[
https://issues.apache.org/jira/browse/TIKA-2079?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tim Allison updated TIKA-2079:
------------------------------
Attachment: Root Entry_46.ttf
Root Entry_44.ttf
Root Entry_41.ttf
Root Entry_25.ttf
Root Entry_22.ttf
297135.ppt
Example file with extracted image files. {{Root Entry_22.ttf}} is likely the
image on slide 11, based on "Kapton foils" and other clear text in the file.
> Unknown embedded image file in ppt
> ----------------------------------
>
> Key: TIKA-2079
> URL: https://issues.apache.org/jira/browse/TIKA-2079
> Project: Tika
> Issue Type: Improvement
> Reporter: Tim Allison
> Priority: Trivial
> Attachments: 297135.ppt, Root Entry_22.ttf, Root Entry_25.ttf, Root
> Entry_41.ttf, Root Entry_44.ttf, Root Entry_46.ttf
>
>
> We recently modified how we're extracting OLE wrapped embedded objects with
> ppts. On a recent regression run, there were quite a few embedded ttf
> exceptions within ppts. Upon closer examination these aren't ttf files, but
> some kind of image/drawing file.
> It seems that these are only in older ppt, and they're rare...so the priority
> on this issue should be "whenever..."
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)