[ 
https://issues.apache.org/jira/browse/PDFBOX-5166?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17324005#comment-17324005
 ] 

Maruan Sahyoun edited comment on PDFBOX-5166 at 4/16/21, 6:16 PM:
------------------------------------------------------------------

Yes there is -  multimedia content such as sound or video and there is 3D 
content. And there are collections. At the end of the day most boil down to 
being streams but I'm not sure if you detect and extract them. 


was (Author: msahyoun):
Yes there is -  multimedia content such as sound or video and there is 3D 
content. 

> Implement RichMedia annotation
> ------------------------------
>
>                 Key: PDFBOX-5166
>                 URL: https://issues.apache.org/jira/browse/PDFBOX-5166
>             Project: PDFBox
>          Issue Type: New Feature
>            Reporter: Tim Allison
>            Priority: Minor
>         Attachments: testFlashInPDF.pdf
>
>
> See TIKA-3359.  The attached file as an embedded Flash/swf file.  Tika is not 
> currently extracting the embedded file.
> In the debugger, I can see the Annotation as a PDAnnotationUnknown.  In the 
> COSDictionary, I can see the subtype is "RichMedia".  If someone has the 
> time, it'd be great to implement this so that we can extract more attachments 
> in Tika...  Obv, others may find use too. :D
> Many thanks to Tyler Thorsted for the test file and many thanks to 
> @terminalboredom and @beet_keeper.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to