[
https://issues.apache.org/jira/browse/PDFBOX-1320?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13281500#comment-13281500
]
Michael McCandless commented on PDFBOX-1320:
--------------------------------------------
Good catch Sumuli! We can also null-check the return from the getNames()
method.
Tika's PDFParser has moved back to Tika sources (thanks Jukka!) ... I'll fix
this there.
But, separately, we should also fix ExtractText to null check the call to
embeddedFiles.getNames()....
> NPE in extractEmbeddedDocuments
> -------------------------------
>
> Key: PDFBOX-1320
> URL: https://issues.apache.org/jira/browse/PDFBOX-1320
> Project: PDFBox
> Issue Type: Bug
> Components: Parsing
> Affects Versions: 1.7.0
> Environment: pdfbox 1.7.0 (current trunk)
> Reporter: Samuli Saarinen
> Attachments: PDNameTreeNode.java.patch
>
>
> While parsing a pdf document the following exception is thrown:
> java.lang.NullPointerException
> at
> org.apache.pdfbox.tika.PDFParser.extractEmbeddedDocuments(PDFParser.java:155)
> at org.apache.pdfbox.tika.PDFParser.parse(PDFParser.java:133)
> at test.TikaParse.main(TikaParse.java:27)
> The document I'm trying to parse is probably confidential so I cannot attach
> it until (or if) I get clearence.
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators:
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira