[
https://issues.apache.org/jira/browse/PDFBOX-1557?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Eric Leleu updated PDFBOX-1557:
-------------------------------
Attachment: JIRA-1557.patch
Hi,
At first glance, some COSObject aren't loaded.
Here is a patch that fix the problem, at the end of initialize method of the
NonSeq parser, I check that all object are loaded in the trailer.
Maybe there are a better way to do this, I'm on it.
BR,
Eric
> NonSequentialPDFParser incorrectly parsing document info
> --------------------------------------------------------
>
> Key: PDFBOX-1557
> URL: https://issues.apache.org/jira/browse/PDFBOX-1557
> Project: PDFBox
> Issue Type: Bug
> Components: Parsing
> Affects Versions: 1.8.0
> Environment: Mac OS X 10.6.8, Eclipse Version: Juno Service Release 2
> (Build id: 20130225-0426), Java SE 6 (1.6.0)
> Reporter: Robert Bartlett-Schneider
> Priority: Minor
> Attachments: aa.pdf, JIRA-1557.patch, TestParsers.java
>
>
> When using the NonSequentialPDFParser, the PDDocumentInformation returned by
> getDocumentInformation() seems to contain all null entries, which does not
> occur when using the standard PDFParser. I have a large batch of PDF files
> which have random and strange issues that cause them to occasionally fail
> with the standard parser, so I was experimenting with the NonSequential
> parser and came across this issue.
> I'll attempt to attach some test code & a test PDF file for which I can
> replicate the issue.
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira