[ https://issues.apache.org/jira/browse/PDFBOX-5283?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17418727#comment-17418727 ]
Michael Klink commented on PDFBOX-5283: --------------------------------------- The PDF is broken. The second object designated as object 8 by a {{8 0 obj}} according to the cross references is object 9! Thus, here the cross references and the objects themselves are at odds. In such a situation different PDF processors may use different strategies in trying to read the PDF nonetheless. None of these strategies is right or wrong, for each strategy there are documents it repairs well and others it fails to repair. Even the strategy to reject the document has its merits: You cannot get fooled by dubious tricks. > No Content - xRef / Obj Parsing > ------------------------------- > > Key: PDFBOX-5283 > URL: https://issues.apache.org/jira/browse/PDFBOX-5283 > Project: PDFBox > Issue Type: Bug > Components: Parsing > Reporter: Oliver Schmidtmer > Priority: Major > Attachments: Lieferschein_110300.pdf > > > There seems to be an issue with xRef / object reading when parsing the > attached pdf. > The PDF itself has for example two objects with the ref "8 0 R": > One at position 1967 with a "/Content" entry. > One at position 7782 without a "/Content" entry. > Both are referenced in the XRef Table, so there seems to be something off. > Probably Acrobat, etc. are using the first object, while PDFBox is using the > second one. -- This message was sent by Atlassian Jira (v8.3.4#803005) --------------------------------------------------------------------- To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org For additional commands, e-mail: dev-h...@pdfbox.apache.org