[
https://issues.apache.org/jira/browse/PDFBOX-1835?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13901299#comment-13901299
]
Henning Bermeyer commented on PDFBOX-1835:
------------------------------------------
The stack trace I posted is complete. since the StackOverflow-Exception falls
through unhandled through most of the wrapping code. It is caught and wrapped
into an IOException as soon as it reaches our code frame, but that part is not
of much use for finding the problem here.
What I can tell is that we are using PDFBox through Tika (PDFParser.parse) and
that it seems as if the PDFStreamParser creates a parseTree, of COSObjects that
either is either unrealistically deep or does not prevent circular references.
The PDF displays fine in Acrobat by the way and is only about 750kB in Size.
> Stackoverflow pdfbox 1.8.3
> --------------------------
>
> Key: PDFBOX-1835
> URL: https://issues.apache.org/jira/browse/PDFBOX-1835
> Project: PDFBox
> Issue Type: Bug
> Affects Versions: 1.8.3
> Environment: CentOS6
> Java 7
> Reporter: Johnny Minty
>
> I guys I'm using Apache Tika in an application which uses PDFbox for PDF
> parsing.
> When trying to read in a specific PDF attachment i'm receiving the following
> stack overflow:
> Caused by: java.lang.StackOverflowError
> at java.lang.String.valueOf(String.java:2854)
> at java.lang.StringBuilder.append(StringBuilder.java:128)
> at
> org.apache.pdfbox.cos.COSDictionary.toString(COSDictionary.java:1450)
> at
> org.apache.pdfbox.cos.COSDictionary.toString(COSDictionary.java:1453)
> at
> org.apache.pdfbox.cos.COSDictionary.toString(COSDictionary.java:1453)
> at
> org.apache.pdfbox.cos.COSDictionary.toString(COSDictionary.java:1453)
> at
> org.apache.pdfbox.cos.COSDictionary.toString(COSDictionary.java:1453)
> at
> org.apache.pdfbox.cos.COSDictionary.toString(COSDictionary.java:1453)
> at
> org.apache.pdfbox.cos.COSDictionary.toString(COSDictionary.java:1453)
> at
> org.apache.pdfbox.cos.COSDictionary.toString(COSDictionary.java:1453)
--
This message was sent by Atlassian JIRA
(v6.1.5#6160)