[jira] [Commented] (PDFBOX-5283) No Content - xRef / Obj Parsing

Michael Klink (Jira) Wed, 22 Sep 2021 10:23:09 -0700


    [ 
https://issues.apache.org/jira/browse/PDFBOX-5283?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17418727#comment-17418727
 ]


Michael Klink commented on PDFBOX-5283:
---------------------------------------

The PDF is broken.

The second object designated as object 8 by a {{8 0 obj}} according to the 
cross references is object 9!

Thus, here the cross references and the objects themselves are at odds. In such 
a situation different PDF processors may use different strategies in trying to 
read the PDF nonetheless. None of these strategies is right or wrong, for each 
strategy there are documents it repairs well and others it fails to repair. 
Even the strategy to reject the document has its merits: You cannot get fooled 
by dubious tricks.

> No Content - xRef / Obj Parsing
> -------------------------------
>
>                 Key: PDFBOX-5283
>                 URL: https://issues.apache.org/jira/browse/PDFBOX-5283
>             Project: PDFBox
>          Issue Type: Bug
>          Components: Parsing
>            Reporter: Oliver Schmidtmer
>            Priority: Major
>         Attachments: Lieferschein_110300.pdf
>
>
> There seems to be an issue with xRef / object reading when parsing the 
> attached pdf.
> The PDF itself has for example two objects with the ref "8 0 R":
> One at position 1967 with a "/Content" entry.
> One at position 7782 without a "/Content" entry.
> Both are referenced in the XRef Table, so there seems to be something off. 
> Probably Acrobat, etc. are using the first object, while PDFBox is using the 
> second one.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

[jira] [Commented] (PDFBOX-5283) No Content - xRef / Obj Parsing

Reply via email to