adam, im sorry. i neither dont know what program has been used nor do i know the password or how to remove the encryption. i only can ask some other people about this. i will open a jira issue and attach the file.
best regards reinhard [email protected] schrieb: > Reinhard, > > The root element in your PDF references object 1554 as the object which > informs us of the pages within this document. This object does not seem > to exist in the PDF, which is a violation of the PDF spec and why PDFBox > is unable to parse it. You can open the PDF in a decent text editor and > search for 1554 and you'll see the Pages section which references this > object, but that's the only place it's found, there's no object > definition. > > Now, having said that, if we can find a reliable way to parse files like > these, we can update the code. Do you know what program was used to > create this PDF? Would it be possible for you to remove the encryption on > this file and try it again? That would make it much easier to debug (if > it still crashes without the encryption, it might not). > > I also encourage you to create an issue of JIRA and upload this file there > (in case the link dies in the future). https://issues.apache.org/jira > > ---- > Thanks, > Adam > > > > > > From: > reinhard schwab <[email protected]> > To: > [email protected] > Date: > 08/21/2010 11:42 > Subject: > NPE in PDPageNode > > > > i get a nullpointer exception when parsing a pdf with tika. > > http://www.awsg.at/portal/media/4218.pdf > > java.lang.NullPointerException > at org.apache.pdfbox.pdmodel.PDPageNode.getCount(PDPageNode.java:109) > at > org.apache.pdfbox.pdmodel.PDDocument.getNumberOfPages(PDDocument.java:943) > at > org.apache.tika.parser.pdf.PDFParser.extractMetadata(PDFParser.java:105) > at org.apache.tika.parser.pdf.PDFParser.parse(PDFParser.java:86) > > > regards > reinhard > > > > > > > ? Click here to submit conditions > > This email and any content within or attached hereto from Sun West Mortgage > Company, Inc. is confidential and/or legally privileged. The information is > intended only for the use of the individual or entity named on this email. If > you are not the intended recipient, you are hereby notified that any > disclosure, copying, distribution or the taking of any action in reliance on > the contents of this email information is strictly prohibited, and that the > documents should be returned to this office immediately by email. Receipt by > anyone other than the intended recipient is not a waiver of any privilege. > Please do not include your social security number, account number, or any > other personal or financial information in the content of the email. Should > you have any questions, please call (800) 453 7884.
