[
https://issues.apache.org/jira/browse/PDFBOX-2393?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14151414#comment-14151414
]
Tilman Hausherr commented on PDFBOX-2393:
-----------------------------------------
[~sunxingzhe359] then I wonder why you didn't open an issue with TIKA...
[[email protected]] does TIKA extract PDF attachments from annotations? The
code should probably be somewhat like this:
https://mail-archives.apache.org/mod_mbox/pdfbox-users/201306.mbox/%[email protected]%3E
List annotations = page.getAnnotations();
The one you are looking for has to be an instance of the class
org.apache.pdfbox.pdmodel.interactive.annotation.PDAnnotationFileAttachment.
> PDF embeded with document can not parse.
> ----------------------------------------
>
> Key: PDFBOX-2393
> URL: https://issues.apache.org/jira/browse/PDFBOX-2393
> Project: PDFBox
> Issue Type: Improvement
> Components: Parsing
> Affects Versions: 1.8.6
> Environment: Windows7
> Reporter: sunxingzhe
> Attachments: embedded.pdf
>
>
> I insert a Excel file into the pdf file.
> But can not extracte embedded excel resources.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)