[ 
https://issues.apache.org/jira/browse/PDFBOX-1557?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Eric Leleu updated PDFBOX-1557:
-------------------------------

    Attachment: JIRA-1557.patch

Hi,

At first glance, some COSObject aren't loaded.
Here is a patch that fix the problem, at the end of initialize method of the 
NonSeq parser, I check that all object are loaded in the trailer.

Maybe there are a better way to do this, I'm on it.

BR,
Eric

                
> NonSequentialPDFParser incorrectly parsing document info
> --------------------------------------------------------
>
>                 Key: PDFBOX-1557
>                 URL: https://issues.apache.org/jira/browse/PDFBOX-1557
>             Project: PDFBox
>          Issue Type: Bug
>          Components: Parsing
>    Affects Versions: 1.8.0
>         Environment: Mac OS X 10.6.8, Eclipse Version: Juno Service Release 2 
> (Build id: 20130225-0426), Java SE 6 (1.6.0)
>            Reporter: Robert Bartlett-Schneider
>            Priority: Minor
>         Attachments: aa.pdf, JIRA-1557.patch, TestParsers.java
>
>
> When using the NonSequentialPDFParser, the PDDocumentInformation returned by 
> getDocumentInformation() seems to contain all null entries, which does not 
> occur when using the standard PDFParser. I have a large batch of PDF files 
> which have random and strange issues that cause them to occasionally fail 
> with the standard parser, so I was experimenting with the NonSequential 
> parser and came across this issue.
> I'll attempt to attach some test code & a test PDF file for which I can 
> replicate the issue.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Reply via email to