[jira] [Commented] (PDFBOX-2393) PDF embeded with document can not parse.

Tilman Hausherr (JIRA) Sun, 28 Sep 2014 23:47:00 -0700

    [ 
https://issues.apache.org/jira/browse/PDFBOX-2393?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14151414#comment-14151414
 ]


Tilman Hausherr commented on PDFBOX-2393:
-----------------------------------------

[~sunxingzhe359] then I wonder why you didn't open an issue with TIKA...

[[email protected]] does TIKA extract PDF attachments from annotations? The 
code should probably be somewhat like this:

https://mail-archives.apache.org/mod_mbox/pdfbox-users/201306.mbox/%[email protected]%3E

List annotations = page.getAnnotations();

The one you are looking for has to be an instance of the class
org.apache.pdfbox.pdmodel.interactive.annotation.PDAnnotationFileAttachment.

> PDF embeded with document can not parse.
> ----------------------------------------
>
>                 Key: PDFBOX-2393
>                 URL: https://issues.apache.org/jira/browse/PDFBOX-2393
>             Project: PDFBox
>          Issue Type: Improvement
>          Components: Parsing
>    Affects Versions: 1.8.6
>         Environment: Windows7
>            Reporter: sunxingzhe
>         Attachments: embedded.pdf
>
>
> I insert a Excel file  into the pdf file.
> But  can not extracte embedded excel resources.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (PDFBOX-2393) PDF embeded with document can not parse.

Reply via email to