[ 
https://issues.apache.org/jira/browse/PDFBOX-5400?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17514121#comment-17514121
 ] 

Oliver Schmidtmer commented on PDFBOX-5400:
-------------------------------------------

Found the difference between trunk and 2.0.

In both branches, if a referenced object is missing in the xref table, the 
object should be read from a brute force object list.

In 2.0 this is not done, because here 
[https://github.com/apache/pdfbox/blob/2.0/pdfbox/src/main/java/org/apache/pdfbox/pdfparser/COSParser.java#L668]
 bfSearchCOSObjectKeyOffsets is null as the brute force search has not jet been 
done.

 

> Page tree root must be a dictionary
> -----------------------------------
>
>                 Key: PDFBOX-5400
>                 URL: https://issues.apache.org/jira/browse/PDFBOX-5400
>             Project: PDFBox
>          Issue Type: Bug
>          Components: Parsing
>    Affects Versions: 2.0.26
>            Reporter: Tilman Hausherr
>            Priority: Major
>              Labels: regression
>         Attachments: 4ECBGZDM5GUZG7UT75RV5GTUFWF5TSXK.pdf
>
>
> worked in 2.0.25
> {noformat}
> Caused by: java.io.IOException: Page tree root must be a dictionary
>     org.apache.pdfbox.pdfparser.PDFParser.initialParse(PDFParser.java:198)
>     org.apache.pdfbox.pdfparser.PDFParser.parse(PDFParser.java:226)
>     org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:1107)
> {noformat}



--
This message was sent by Atlassian Jira
(v8.20.1#820001)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to