[ 
https://issues.apache.org/jira/browse/PDFBOX-1835?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14347313#comment-14347313
 ] 

Tyler Palsulich commented on PDFBOX-1835:
-----------------------------------------

Good idea! I missed that comment above. Here it is:
{code}
        at org.apache.pdfbox.cos.COSDictionary.toString(COSDictionary.java:1560)
        at org.apache.pdfbox.cos.COSDictionary.toString(COSDictionary.java:1560)
        at org.apache.tika.parser.pdf.PDFParser.addMetadata(PDFParser.java:477)
        at 
org.apache.tika.parser.pdf.PDFParser.extractMetadata(PDFParser.java:268)
        at org.apache.tika.parser.pdf.PDFParser.parse(PDFParser.java:144)
        at 
org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:270)
        at 
org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:270)
        at 
org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:120)
        at org.apache.tika.cli.TikaCLI$OutputType.process(TikaCLI.java:153)
        at org.apache.tika.cli.TikaCLI.process(TikaCLI.java:450)
        at org.apache.tika.cli.TikaCLI.main(TikaCLI.java:123)
{code}

[PDFParser.java:477|https://github.com/apache/tika/blob/trunk/tika-parsers/src/main/java/org/apache/tika/parser/pdf/PDFParser.java#L477]
 is where the COSDictionary.toString() is called. [This comment | 
https://issues.apache.org/jira/browse/PDFBOX-1525?focusedCommentId=13588884&page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#comment-13588884]
 suggests not using {{COSDictionary.toString()}}. So, what is the preferred way?

> Stackoverflow pdfbox 1.8.3
> --------------------------
>
>                 Key: PDFBOX-1835
>                 URL: https://issues.apache.org/jira/browse/PDFBOX-1835
>             Project: PDFBox
>          Issue Type: Bug
>    Affects Versions: 1.8.3
>         Environment: CentOS6
> Java 7
>            Reporter: Johnny Minty
>
> I guys I'm using Apache Tika in an application which uses PDFbox for PDF 
> parsing.
> When trying to read in a specific PDF attachment i'm receiving the following 
> stack overflow:
> Caused by: java.lang.StackOverflowError
>         at java.lang.String.valueOf(String.java:2854)
>         at java.lang.StringBuilder.append(StringBuilder.java:128)
>         at 
> org.apache.pdfbox.cos.COSDictionary.toString(COSDictionary.java:1450)
>         at 
> org.apache.pdfbox.cos.COSDictionary.toString(COSDictionary.java:1453)
>         at 
> org.apache.pdfbox.cos.COSDictionary.toString(COSDictionary.java:1453)
>         at 
> org.apache.pdfbox.cos.COSDictionary.toString(COSDictionary.java:1453)
>         at 
> org.apache.pdfbox.cos.COSDictionary.toString(COSDictionary.java:1453)
>         at 
> org.apache.pdfbox.cos.COSDictionary.toString(COSDictionary.java:1453)
>         at 
> org.apache.pdfbox.cos.COSDictionary.toString(COSDictionary.java:1453)
>         at 
> org.apache.pdfbox.cos.COSDictionary.toString(COSDictionary.java:1453)



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to