[
https://issues.apache.org/jira/browse/PDFBOX-897?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Adam Nichols resolved PDFBOX-897.
---------------------------------
Resolution: Fixed
Fix Version/s: 1.4.0
Assignee: Adam Nichols
Thanks for the extra info, now I understand how it can be null. I committed
your patch and added an extra protection to avoid a NPE in case there's any
exceptional case (i.e. a non-conforming, and/or corrupt PDF). I also added
some logging info so if there so if we ever come across one of these
exceptional cases, it will at least be logged so users can track it down.
Committed in revision 1037891.
> NullPointerException PDFFont#getEncodingFromFont with a PDF book because
> Type1Encoding is null
> ----------------------------------------------------------------------------------------------
>
> Key: PDFBOX-897
> URL: https://issues.apache.org/jira/browse/PDFBOX-897
> Project: PDFBox
> Issue Type: Bug
> Components: Parsing
> Affects Versions: 1.3.1
> Reporter: Martijn Brinkers
> Assignee: Adam Nichols
> Fix For: 1.4.0
>
> Attachments: PDFBOX-897.patch
>
>
> A NullPointerException was thrown while extracting text from a PDF ebook. The
> exception was thrown in
> PDFFont#getEncodingFromFont line:
> [snip]
> encoding.addCharacterEncoding(index, name.replace("/", ""));
> [snip]
> encoding was null. The line that was scanned was "/Encoding 256 array 0 1 255
> {1 index exch /.notdef put} for". The array check however only checks for
> line.endsWith("array"). The NPE was fixed when using line.contains("array")
> instead.
> I have added a patch. The PDF is a PDF book with copyright so it cannot be
> attached as an example. The meta data of the document was:
> Acrobat Distiller 7.0 (Windows)
> PScript5.dll Version 5.2.2
> PDF-1.6
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.