[
https://issues.apache.org/jira/browse/PDFBOX-5178?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17655689#comment-17655689
]
Andreas Lehmkühler commented on PDFBOX-5178:
--------------------------------------------
I've found a way to make PDFBox a little more lenient. The parser stumbled upon
the invalid dictionary entry {{/F a8}} and returned {{null}} so that the parser
stops reading the following dictionary values. In the end some valid entries
are missing which explains the differences. Now the parser returns {{COSNull}}
as value which is still invalid but the parser doesn't reading a the following
values are read again.
[~tilman] Is there any chance to run your tests to see if there are any side
effects? Thanks in advance
> Parsing differences between 2.0.23 and 2.0.24/3.0
> -------------------------------------------------
>
> Key: PDFBOX-5178
> URL: https://issues.apache.org/jira/browse/PDFBOX-5178
> Project: PDFBox
> Issue Type: Bug
> Components: Parsing
> Affects Versions: 2.0.23, 3.0.0 PDFBox
> Reporter: Tilman Hausherr
> Assignee: Andreas Lehmkühler
> Priority: Major
> Attachments: poppler-704-0.pdf
>
>
> There are some weird differences in parsing the attached file, 2.0.23 shows
> "BigTIFF.tif" in the /Contents of the first annotation and a loop at
> Root/Pages/Kids/[0]/Annots/[0]/FS (always 14 0 R), while 3.0 doesn't have
> that, but doesn't have "BigTIFF.tif". I'm not sure which one (if any) is
> wrong.
>
> UPDATE
> 2.0.24 shows the same behaviour as 3.0
--
This message was sent by Atlassian Jira
(v8.20.10#820010)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]