[
https://issues.apache.org/jira/browse/PDFBOX-2245?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14075466#comment-14075466
]
ASF subversion and git services commented on PDFBOX-2245:
---------------------------------------------------------
Commit 1613708 from [~tilman] in branch 'pdfbox/trunk'
[ https://svn.apache.org/r1613708 ]
PDFBOX-2245: avoid exception for empty unicode string in
getGIDForCharacterCode()
> java.lang.StringIndexOutOfBoundsException in
> PDTrueTypeFont.getGIDForCharacterCode
> ----------------------------------------------------------------------------------
>
> Key: PDFBOX-2245
> URL: https://issues.apache.org/jira/browse/PDFBOX-2245
> Project: PDFBox
> Issue Type: Bug
> Components: Parsing
> Affects Versions: 2.0.0
> Reporter: Tilman Hausherr
> Assignee: Tilman Hausherr
> Fix For: 2.0.0
>
> Attachments: PDFBOX-2245-052567.pdf
>
>
> I get this exception with the attached file:
> {code}
> java.lang.StringIndexOutOfBoundsException: String index out of range: 0
> at java.lang.String.codePointAt(Unknown Source)
> at
> org.apache.pdfbox.pdmodel.font.PDTrueTypeFont.getGIDForCharacterCode(PDTrueTypeFont.java:452)
> at
> org.apache.pdfbox.rendering.font.TTFGlyph2D.getGIDForCharacterCode(TTFGlyph2D.java:178)
> at
> org.apache.pdfbox.rendering.font.TTFGlyph2D.getPathForCharacterCode(TTFGlyph2D.java:150)
> at
> org.apache.pdfbox.rendering.PageDrawer.drawGlyph2D(PageDrawer.java:354)
> at
> org.apache.pdfbox.rendering.PageDrawer.processGlyph(PageDrawer.java:325)
> {code}
> The cause is that Encoding.getCharacterForName(characterName) returns an
> empty string. This is because characterName is ".notdef". Adobe Reader
> displays a gap. This can be seen at "Alumni Weekend" and "health
> professions". A copy & paste into an editor shows that Adobe returns a hex 1
> code. My change ignores the wrong code, outputs an error message and the file
> renders like in adobe reader.
--
This message was sent by Atlassian JIRA
(v6.2#6252)