[
https://issues.apache.org/jira/browse/PDFBOX-1557?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13624856#comment-13624856
]
Andreas Lehmkühler commented on PDFBOX-1557:
--------------------------------------------
As long as PDFBox doesn't load objects on demand, this seems to be the only
possible way to do this.
@Eric
If you don't mind I'll commit the patch as proposed so that it'll be part of
the 1.8.1 release.
> NonSequentialPDFParser incorrectly parsing document info
> --------------------------------------------------------
>
> Key: PDFBOX-1557
> URL: https://issues.apache.org/jira/browse/PDFBOX-1557
> Project: PDFBox
> Issue Type: Bug
> Components: Parsing
> Affects Versions: 1.8.0
> Environment: Mac OS X 10.6.8, Eclipse Version: Juno Service Release 2
> (Build id: 20130225-0426), Java SE 6 (1.6.0)
> Reporter: Robert Bartlett-Schneider
> Priority: Minor
> Attachments: aa.pdf, JIRA-1557.patch, TestParsers.java
>
>
> When using the NonSequentialPDFParser, the PDDocumentInformation returned by
> getDocumentInformation() seems to contain all null entries, which does not
> occur when using the standard PDFParser. I have a large batch of PDF files
> which have random and strange issues that cause them to occasionally fail
> with the standard parser, so I was experimenting with the NonSequential
> parser and came across this issue.
> I'll attempt to attach some test code & a test PDF file for which I can
> replicate the issue.
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira