[
https://issues.apache.org/jira/browse/TIKA-3359?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tim Allison reopened TIKA-3359:
-------------------------------
Reopening to handle searching the DOM for files a bit more robust...
> Extract swf from PDFs
> ---------------------
>
> Key: TIKA-3359
> URL: https://issues.apache.org/jira/browse/TIKA-3359
> Project: Tika
> Issue Type: Task
> Reporter: Tim Allison
> Priority: Major
> Fix For: 2.0.0
>
>
> On twitter, @terminalboredom and Tyler Thorsted shared examples of PDF files
> with embedded flash. I ran -z on tika-app, and we're not extracting these
> files. I suspect they're in a structure we're not currently checking.
> https://twitter.com/CHLThor/status/1382888365767360513?s=20
> https://twitter.com/sonicstacey/status/1382956466332573701?s=20
> Many thanks to @beet_keeper for putting us in touch.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)