[
https://issues.apache.org/jira/browse/PDFBOX-897?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12934067#action_12934067
]
Adam Nichols commented on PDFBOX-897:
-------------------------------------
I'm looking at PDFont and the only place I see:
encoding.addCharacterEncoding(index, name.replace("/", ""));
is on line 832, which is within this block:
else if (line.startsWith("dup")) {
Since your line doesn't begin with dup, I'm not sure why this line would ever
be executed, nor how your patch would fix anything. Are you using the latest
version of PDFBox? Maybe someone changed this between the version you are
using and the one I'm looking at. I'm looking at the latest code from SVN.
> NullPointerException PDFFont#getEncodingFromFont with a PDF book because
> Type1Encoding is null
> ----------------------------------------------------------------------------------------------
>
> Key: PDFBOX-897
> URL: https://issues.apache.org/jira/browse/PDFBOX-897
> Project: PDFBox
> Issue Type: Bug
> Components: Parsing
> Affects Versions: 1.3.1
> Reporter: Martijn Brinkers
> Attachments: PDFBOX-897.patch
>
>
> A NullPointerException was thrown while extracting text from a PDF ebook. The
> exception was thrown in
> PDFFont#getEncodingFromFont line:
> [snip]
> encoding.addCharacterEncoding(index, name.replace("/", ""));
> [snip]
> encoding was null. The line that was scanned was "/Encoding 256 array 0 1 255
> {1 index exch /.notdef put} for". The array check however only checks for
> line.endsWith("array"). The NPE was fixed when using line.contains("array")
> instead.
> I have added a patch. The PDF is a PDF book with copyright so it cannot be
> attached as an example. The meta data of the document was:
> Acrobat Distiller 7.0 (Windows)
> PScript5.dll Version 5.2.2
> PDF-1.6
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.