[
https://issues.apache.org/jira/browse/PDFBOX-5009?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17228004#comment-17228004
]
Tilman Hausherr commented on PDFBOX-5009:
-----------------------------------------
I thought I was getting a stack overflow with PDFDebugger but no, this was
probably because of some local changes.
Doing PDPage.get() on such files can bring an unchecked exception. Still bad,
but not as bad as a stack overflow. So I have documented it. Preventing it
seems tricky and would require an API change. It should be done in a separate
issue.
> Corrupt PDF can lead to a StackOverflow
> ---------------------------------------
>
> Key: PDFBOX-5009
> URL: https://issues.apache.org/jira/browse/PDFBOX-5009
> Project: PDFBox
> Issue Type: Task
> Components: Text extraction
> Affects Versions: 2.0.21
> Reporter: Tim Allison
> Priority: Minor
> Fix For: 2.0.22, 3.0.0 PDFBox
>
>
> See TIKA-3224. I confirmed this with 2.0.21 by calling the app's ExtractText
> on the file posted on the Tika issue.
> cc [~dadoonet]
--
This message was sent by Atlassian Jira
(v8.3.4#803005)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]