[
https://issues.apache.org/jira/browse/PDFBOX-3398?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15350209#comment-15350209
]
Tilman Hausherr commented on PDFBOX-3398:
-----------------------------------------
I doubt that. It is still a complex structure.
Btw the text is here:
Root/Pages/Kids/\[0]/Contents
You'll need to understand the operators, see "operator summary" in the PDF spec.
> Text (XML) output of pdf structure
> ----------------------------------
>
> Key: PDFBOX-3398
> URL: https://issues.apache.org/jira/browse/PDFBOX-3398
> Project: PDFBox
> Issue Type: New Feature
> Components: Parsing, Utilities
> Reporter: Stefan Hegny
> Priority: Minor
>
> It would be nice to have a text/xml representation output to pdf file of the
> entire document structure as can be browsed in the debugger window GUI. It
> would allow for easier searching and understanding of the structure. Not sure
> if it should be an option to PDFReader/PDFDebugger or a separate class that
> might also be bundled into an app jar. I would even start working on it given
> the preferred base to start on
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]