[
https://issues.apache.org/jira/browse/PDFBOX-5551?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17736746#comment-17736746
]
Andreas Lehmkühler commented on PDFBOX-5551:
--------------------------------------------
The content stream in question consists of several substreams and one of them
is malformed. I've made the code for content streams more lenient, so that in
such cases an empty content (sub)stream is returned. Now, the given pdf is
rendered without any exception, but more less of the content is missing.
> FoxHexOne Mutation PDF crashes both PDFBox 2.0.27 and 3.0.0.alpha3
> ------------------------------------------------------------------
>
> Key: PDFBOX-5551
> URL: https://issues.apache.org/jira/browse/PDFBOX-5551
> Project: PDFBox
> Issue Type: Bug
> Components: Utilities
> Affects Versions: 2.0.27, 3.0.0 PDFBox
> Environment: Windows 11 x64
> Reporter: Peter Wyatt
> Assignee: Andreas Lehmkühler
> Priority: Major
> Fix For: 3.0.0 PDFBox
>
> Attachments: file1114.pdf, image-2022-12-08-16-47-01-104.png,
> image-2022-12-08-16-49-10-816.png
>
>
> PDFBox Debugger 2.0.27 and 3.0.0.alpha3 both crash with
> {{java.util.concurrent.ExecutionException: java.io.IOException:
> java.util.zip.DataFormatException: invalid distance too far back}} while
> attempting to open FoxHexOne Mutation {{file1114.pdf}} (see
> [https://github.com/pdf-association/pdf-corpora#foxhex0ne-mutations]). In the
> PDFBox Debugger window, the Page tree is populated with pages 1-10.
> Yes, this is somehow a bad file, but I was hoping to find out why.
>
> PDFBox Debugger 2.0.27:
> !image-2022-12-08-16-47-01-104.png|width=510,height=463!
> PDFBox 3.3.0.alpha:
> !image-2022-12-08-16-49-10-816.png|width=531,height=485!
--
This message was sent by Atlassian Jira
(v8.20.10#820010)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]