[
https://issues.apache.org/jira/browse/PDFBOX-5178?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17709980#comment-17709980
]
Andreas Lehmkühler commented on PDFBOX-5178:
--------------------------------------------
The pre-tests reveal some other minor regressions. The parser hits a malformed
array within a content stream when reading the attached file
[^GHOSTSCRIPT-699768-0.pdf] and throws an exception. My fix makes the parser
more lenient again by swallowing any exception which occurs when hitting a
malformed array or dictionary. The parser simply stops prematurely and returns
null.
@Thanks for your feedback
> Parsing differences between 2.0.23 and 2.0.24/3.0
> -------------------------------------------------
>
> Key: PDFBOX-5178
> URL: https://issues.apache.org/jira/browse/PDFBOX-5178
> Project: PDFBox
> Issue Type: Bug
> Components: Parsing
> Affects Versions: 2.0.23, 3.0.0 PDFBox
> Reporter: Tilman Hausherr
> Assignee: Andreas Lehmkühler
> Priority: Major
> Fix For: 2.0.28, 3.0.0 PDFBox
>
> Attachments: GHOSTSCRIPT-699768-0.pdf, MOZILLA-1129855-0.pdf,
> poppler-704-0.pdf
>
>
> There are some weird differences in parsing the attached file, 2.0.23 shows
> "BigTIFF.tif" in the /Contents of the first annotation and a loop at
> Root/Pages/Kids/[0]/Annots/[0]/FS (always 14 0 R), while 3.0 doesn't have
> that, but doesn't have "BigTIFF.tif". I'm not sure which one (if any) is
> wrong.
>
> UPDATE
> 2.0.24 shows the same behaviour as 3.0
--
This message was sent by Atlassian Jira
(v8.20.10#820010)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]