[ 
https://issues.apache.org/jira/browse/PDFBOX-1761?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13808062#comment-13808062
 ] 

William Palmer commented on PDFBOX-1761:
----------------------------------------

It looks like a duplicate although the beginning of the stack trace isn't 
identical.

No exception is thrown when using the most recent 2.0.0-SNAPSHOT from 
http://repository.apache.org/snapshots/ 

Although no exception is thrown with 2.0.0-SNAPSHOT, no text is extracted from 
the pdf.

> java.lang.StringIndexOutOfBoundsException: String index out of range: 2047
> --------------------------------------------------------------------------
>
>                 Key: PDFBOX-1761
>                 URL: https://issues.apache.org/jira/browse/PDFBOX-1761
>             Project: PDFBox
>          Issue Type: Bug
>          Components: Parsing
>    Affects Versions: 1.8.2
>         Environment: JDK6
>            Reporter: William Palmer
>            Priority: Minor
>
> Using code samples provided in PDFBOX-1757 using load() and loadNonSeq() 
> gives the following exception(s) for the test file:
>  -http://digitalcorpora.org/corp/nps/files/govdocs1/447/447403.pdf
>       java.lang.StringIndexOutOfBoundsException: String index out of range: 
> 2047
>       at 
> java.lang.AbstractStringBuilder.deleteCharAt(AbstractStringBuilder.java:770)
>       at java.lang.StringBuilder.deleteCharAt(StringBuilder.java:263)
>       at 
> org.apache.pdfbox.pdfparser.BaseParser.parseCOSHexString(BaseParser.java:1000)
>       at 
> org.apache.pdfbox.pdfparser.BaseParser.parseCOSString(BaseParser.java:808)
>       -using loadnonseq
>       java.lang.StringIndexOutOfBoundsException: String index out of range: 
> 2047
>       at 
> java.lang.AbstractStringBuilder.deleteCharAt(AbstractStringBuilder.java:770)
>       at java.lang.StringBuilder.deleteCharAt(StringBuilder.java:263)
>       at 
> org.apache.pdfbox.pdfparser.BaseParser.parseCOSHexString(BaseParser.java:1000)
>       at 
> org.apache.pdfbox.pdfparser.BaseParser.parseCOSString(BaseParser.java:808)



--
This message was sent by Atlassian JIRA
(v6.1#6144)

Reply via email to