[jira] [Commented] (PDFBOX-5009) Corrupt PDF can lead to a StackOverflow

Tilman Hausherr (Jira) Sun, 08 Nov 2020 06:18:45 -0800


    [ 
https://issues.apache.org/jira/browse/PDFBOX-5009?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17228004#comment-17228004
 ]


Tilman Hausherr commented on PDFBOX-5009:
-----------------------------------------

I thought I was getting a stack overflow with PDFDebugger but no, this was 
probably because of some local changes.

Doing PDPage.get() on such files can bring an unchecked exception. Still bad, 
but not as bad as a stack overflow. So I have documented it. Preventing it 
seems tricky and would require an API change. It should be done in a separate 
issue.

> Corrupt PDF can lead to a StackOverflow
> ---------------------------------------
>
>                 Key: PDFBOX-5009
>                 URL: https://issues.apache.org/jira/browse/PDFBOX-5009
>             Project: PDFBox
>          Issue Type: Task
>          Components: Text extraction
>    Affects Versions: 2.0.21
>            Reporter: Tim Allison
>            Priority: Minor
>             Fix For: 2.0.22, 3.0.0 PDFBox
>
>
> See TIKA-3224.  I confirmed this with 2.0.21 by calling the app's ExtractText 
> on the file posted on the Tika issue.
> cc [~dadoonet]



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

[jira] [Commented] (PDFBOX-5009) Corrupt PDF can lead to a StackOverflow

Reply via email to