[ 
https://issues.apache.org/jira/browse/PDFBOX-5178?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17655689#comment-17655689
 ] 

Andreas Lehmkühler commented on PDFBOX-5178:
--------------------------------------------

I've found a way to make PDFBox a little more lenient. The parser stumbled upon 
the invalid dictionary entry {{/F a8}} and returned {{null}} so that the parser 
stops reading the following dictionary values. In the end some valid entries 
are missing which explains the differences. Now the parser returns {{COSNull}} 
as value which is still invalid but the parser doesn't reading a the following 
values are read again.

[~tilman] Is there any chance to run your tests to see if there are any side 
effects? Thanks in advance


> Parsing differences between 2.0.23 and 2.0.24/3.0
> -------------------------------------------------
>
>                 Key: PDFBOX-5178
>                 URL: https://issues.apache.org/jira/browse/PDFBOX-5178
>             Project: PDFBox
>          Issue Type: Bug
>          Components: Parsing
>    Affects Versions: 2.0.23, 3.0.0 PDFBox
>            Reporter: Tilman Hausherr
>            Assignee: Andreas Lehmkühler
>            Priority: Major
>         Attachments: poppler-704-0.pdf
>
>
> There are some weird differences in parsing the attached file, 2.0.23 shows 
> "BigTIFF.tif" in the /Contents of the first annotation and a loop at 
> Root/Pages/Kids/[0]/Annots/[0]/FS (always 14 0 R), while 3.0 doesn't have 
> that, but doesn't have "BigTIFF.tif". I'm not sure which one (if any) is 
> wrong.
>  
> UPDATE
> 2.0.24 shows the same behaviour as 3.0



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to